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(57) Abstract 

The present invention provides methods and compositions for identifying inhibitors of the interaction between phosphopeptidc binding 
pairs. U., a protein domain having at least one phosphopeptidc binding domain and the phosphoiylated ligands that bind these domains. 
These inhibitors may be used for pharmaceutical compositions and in therapeutic treatments for diseases in which a phosphopeptidc domain 
binding is implicated. 
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CELL-BASED ASSAY 

Technical Field of the Invention 

Background of the Invention 

Cellular signal transduction, i.e.. the series of events leading from extracellular events to 

iTein t^^^ °' ''^•^ disease is Nlero^ 

non?j^L.n / •^^"^''"'='"9 '"^'^"'es have been identified, including receptor and 

aZme? ?Ll'r'"'. T"'- P^°^P^«»^««« ^"'l molecules with enzymatic or regulator^ 
actMt.es These molecules generally demonstrate the capacity to associate specifically with other 
protems to form a signaling complex that can altar cell activity 

m«rt...^'^?K"r^-'"°*^'"^ °' conserved sequence which sen/e as non-catalytic 

modules that direct protein-protein interactions during signal transduction. One such Z^n is he 

oroteinr^o'' T ' '"""^ combinations and locations in Sent 

p oteins. For example, some members of the Src-tamily of tyrosine kinases, e.g., Abl. GRB2 and 
mK. each contain one SH2 domain. Other tyrosine kinases, such as ZAP and S^k contain ^o SH2 

20 Xi:yo::i^re"r?cu:nt'"^'^ ^ — ---^^ °' P-"- 

«.ootS^.^ domains direct the association of specific proteins or protein domains by binding 
selectively and with specificity to protein sequences or motifs containing phospholyrosine For 
example, upon ligand binding, the PDGF beta-receptor dimerizes and autophosphorJSs mlple 
risers ([Hh"'- P^.°^.P^°^.'f °" °' tyrosine-containing protein motifs withSI the receptor 
^nfrZ nA^r SH2-containing proteins such as c-src. PLC-gamma. PI3K 

and ras-GAP. fomi.ng a signaling complex. Other examples include the binding of the tandem SH2- 
r^rin nr^ t^ J° tyros.ne-phosphorylated motifs on the beta or gamma subunits of the IgE 

nhocnh^' ! . . °' SH2-containing protein ZAP-70 with tyrosine- 

phosphory ated motifs on subunits of the T cell receptor. Other protein receptors for tyrosine- 
Somi;^rrP?RT'^'" T'^T and include the so-called phospholyrosine binding 

S i! . °[ Phosphotyrosme interaction domains ("PIDs"). Many signaling pathways 

co^^ininn nZT " '"""'^^^^ '^^^ '""'^'"3 °' a phosphotyrosine- 

containing protein or protein domain with an SH2 or other protein receptor for a tyrosine- 
phosphorylated domain. *^ lyrosme 

Pharmaceutical agents which interfere with the formation or stability of such signaling 

complexes may be used for precise intervention in these complex biological processes fn o Ser to 
treat or prevent the diseases or pathological effects mediated by such signaling However one 

ruTsoecif!?" TT* """f " ^"^^"'"^ Pham,aceu.ically useful compounds tha 
disrupt specific cellular events is deciphering their mechanism and specificity of action Tvoicallv 

^Je^rsre^V >; n"o?mr ' '''T' "^""'^ "^^^^"^"^ °' tesf compords oTan^^^' 

■mfrlf T« fK T ! ^' ^^^^^ downstream of a protein-protein interaction of particular 

rretiL oJ'^H ' T*^""' '"^y '''''' °" """'ber of intermediate steps, 

the results of such expenments can be difficult to interpret 

247 ntamt^^H '"««^action assay described by Song and Relds. Nature. 340:245- 

t , .o f ^ ^ "^^"^ *° interactions of a variety of molecules See also Fields 17 

at. US Patent No. 5.283.173 (1 Feb 1994). The two-hybrid assay is baseron the oSeivSion that 
ranscnp lon factors contain separable functional modules that direct either DNTb^dTng o 
transcrip tion activation. A DNA binding domain expressed in cells will bind to DNA but n?t activate 
transcnption as ,t lacks a transcription activation domain. Conversely, a transcript! acSon 
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domain alone will not affect transcription in the absence of directed and/or intimate interaction 
with DNA such as would be provided by a DNA-binding domain. However, if the DNA binding domain 
and the transcription activation domains are each expressed as part of separate fusion proteins, and 
the fusion proteins are capable of associating, the Iwo-hybrid" complex so fonned represents a 
5 reconstituted transcription factor (see FIG. 1). Such a reconstituted transcription factor is capable 
of initiating transcription of a reporter gene (e.g., a gene for a conveniently detectable marker 
such as beta-galactosidase or alkaline phosphatase (SEAP) or a protein important for cell 
viability) located downstream of DNA binding sites recognized by the DNA-binding domain. The 
amount of reporter gene expression, i.e., the amount of gene product produced, will reflect the 
1 0 extent to which the fusion proteins complex with one another. Compounds that block the association 
of the fusion proteins will reduce reporter gene expression. 

The two-hybrid assay approach has been used to identity presumed SH2-dependent protein- 
protein interactions using yeast (Xing, Z. et al.. Mo\. Biol. Cell, 5:413-421 (1994) and Osborne, 
IWI.A. et al., Biotechnol., 13:1474-1478 (1995)J. In those experiments, protein binding pairs, 

1 5 rather than inhibitors of such binding, were identified. The two hybrid approach has also been used 

to detect the inhibition of certain protein-protein interactions in yeast, but not for inhibition of 
protein-protein interactions involving phosphorylated ligands and their receptors [Chaudhuri, B. 
et al.. FEES Lett., 357: 221-226 (1995)). While a yeast-based two-hybrid approach may be 
useful for other purposes, it would not lend itself to screening for drugs to inhibit phosphopeptide- 

2 0 mediated interactions in mammalian cells since the use of yeast cells would limit the researcher, 

or at least the "hits", to the subset of compounds which are able to penetrate the yeast cell wall. 
This artificial limitation would incorporate into the experimental design the risk of missing 
important new compounds capable of blocking key signaling interactions in mammalian cells, but 
which are unable to penetrate yeast cells. 

2 5 There thus remains a need in the art for improved methods and compositions for identifying 

inhibitors of phosphopeptide-mediated protein-protein binding in a wide variety of host cells. 

Summary of the Invention 

The present invention addresses this need by providing a mechanism-based in vivo assay that 

3 0 allows the identification of inhibitory compounds that target key molecules involved in specific 

signal transduction pathways. 

The invention provides a cell containing, and capable of expressing, recombinant DNA 
molecules encoding a pair of fusion proteins. One DNA molecule encodes a transcription activating 
fusion protein which comprises one or more transcription activation domains (TADs) and at least 

3 5 one member of a phosphopeptide binding pair, i.e., a phosphopeptide binding domain or peptide 

ligand therefor. Another DNA molecule encodes a DNA-binding fusion protein which comprises one 
or more DNA-binding domains (DBDs) and the other member of the phosphopeptide binding pair. 
The cell further contains a reporter gene which is linked to a DNA sequence to which the DNA- 
binding fusion protein is capable of binding. The reporter gene encodes a detectable gene product 

4 0 which is expressed when transcription of the reporter gene is activated, e.g. following association 

of the two fusion proteins to reconstitute an effective transcription factor. The engineered cell 
further contains a protein kinase capable of phosphorytating one or more of the tyrosine residues 
of the peptide ligand as is required for two-hybrid formation. The kinase may be endogenous to the 
cell or may be the gene product of an introduced, heterologous gene. In one embodiment one of the 
4 5 fusion proteins contains a kinase domain and thus provides its own kinase function. One or both of 
the fusion proteins may optionally contain a nuclear hormone-binding domain (HBD) permitting 
regulated sequestering or inactivation of the fusion protein. In such embodiments, addition of the 
hormone renders the HBD-bearing fusion protein available for two-hybrid formation. When the 
cell expresses the two fusion proteins, and the peptide ligand domain has become tyrosine 
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A wide variety of test compositions can be assayed in accordance witti ttiis invention, 
including, e.g., microbial broths, cellular extracts, conditioned media from cell lin s or from host 
cells transformed with genetic libraries, collections of synthetic compounds, combinatorial 
libraries or other compounds or mixtures from synthetic programs based on conventional 
5 medicinal chemistry approaches or structure-based drug design. 

This invention thus provides a means for identifying selective inhibitors of phosphopeptide 
binding interactions. As noted at the outset, phosphopeptide binding domains are present in a wide 
variety of proteins, including proteins involved in intracellular signal transduction pathways 
relevant to a number of important normal and disease processes. Accordingly, inhibitors identified 
1 0 through this invention may be useful for a variety of purposes. First, they may be useful as 

biological reagents in assays as described herein for functional classification of a phosphopeptide 
domain or phosphopeptide receptor of a particular protein, particularly a newly discovered 
protein. Families or classes of such proteins may thus be defined functionally, with respect to 
binding specificity. 

1 5 Moreover, inhibitory agents of this invention can be used to inhibit the occurrence of 

biological events resulting from molecular interactions mediated by a phosphopeptide binding 
domain. This invention thus provides a method and reagents for inhibiting (totally or partially) the 
interaction between a protein containing a phosphopeptide binding domain and a natural ligand 
thereto (i.e.. a protein which normally binds in a cell to the SH2. PID or other phosphopeptide 

2 0 binding protein) or a biological activity mediated by such interaction. In a research context, such 

inhibition can be used to study the biological role of the signaling event and the cell and molecular 
biology of the affected signal transduction pathway. 

An inhibitor identified by the method of this invention can be formulated into a 
pharmaceutical composition containing a pharmaceutically acceptable carrier and/or other 

2 5 excipient(s) using conventional materials and means. Such a composition can be administered to an 

animal, either human or non-human, for prevention or treatment of a disease or condition 
resulting from cellular events involving a phosphopeptide/PBD-mediated protein-protein 
interaction. Administration of such composition may be by any conventional route (parenteral, 
oral, inhalation, and the like) using appropriate formulations as are well known in this art. The 

3 0 inhibitor of this invention can be employed in admixture with conventional excipients, i.e., 

pharmaceutically acceptable organic or inorganic carrier substances suitable for parenteral 
administration. Inhibitors first identified by these methods, pharmaceutical compositions 
containing such inhibitors, and pharmaceutical methods involving administration of such 
inhibitors to patients for the treatment or prevention of diseases mediated by the binding 

3 5 interactions of naturally occurring phosphopeptide binding pairs is therefore an important object 

of this invention. 

Other aspects and advantages of the present invention are described further in the following 
detailed description of the preferred embodiments thereof. 

4 0 Brief Description of the Drawings 

FIG. 1 depicts various assay configurations of this invention. 

FIG. 2 is a bar graph depicting the results of a zeta (Q ITAM-tandem ZAP SH2-dependent 
two-hybrid formation in mammalian cells. Secreted alkaline phosphatase (SEAP) activity (RU) is 
plotted against cells which received the following DNAs, which correspond to the columns in the 
4 5 grapn: (1) carrier DNA, (2) pMSZ (Example ID), (3) pMSZ and pMAZ22 (Example IE), (4) 
pMS (GAL4 DNA binding domain-vSrc kinase), and (5) pMS and pMAZ22. 

FIG. 3 illustrates an estrogen-regulated assay system. The bar graph depicts the results 
observed in the presence (+) and absence (-) of estrogen on Zeta ITAIy^-tandem ZAP SH2 dependent 
two-hybrid formation in mammalian G5IL-2 HT1GB0 cells. SEAP activity is plotted against cells 
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Which received plasmid DMAs in duplicate, i.e.. cols. 1 and 2 are carrier DNA; cols. 3 and 4 are 
?0 aS SlrL?nd%V^I' ''^"^^ "'^^ ' ' ""^''^ ^^^'"'^^ ^ 

<; ^'^^l" °* ^ two-hybrid assay plotting SEAP activity units vs. four stable 

5 Clones A through D. each grown with (dark bar) or without (light bar) inducer (estrogen) 
• ^ K f^*fl ^'^^ '^"^ """^ '^""'^^ °* SEAP production (RU) in Clone D with 

'^^a^^V l^T'^T"" SH2-dependent two-hybrid activity vs. tirne following ^uogen 
addition to media from 0 minutes to 24 hours. The graph also illustrates that parent cells 
(unengineered G5IL-2 HT1080 cells) do not produce SEAP when cultured in the presence (.E) or 

1 0 absence (-) of estrogen. 

. K ^!.^. ? ^ specific Beta ITAM-Src SH2-dependent 

two-hybrid formation in mammalian cells transfected with plasmid DMAs, which correspond to the 

pMSBand"pMA^r^' ^^^^ ^^''^"""^ ^^^^ ^""^ ^^^^^ (Bxample 1N): or (3) 

^ ^ <;hp HoSinH '"t^'^^'.^K .'"""'"^ ^^^^ ^'^"^"y regulation of Beta ITAM-Src 

SH2-dependent two-hybnd formation in mammalian cells. The presence (-.) or absence (-) of 

Sl'ii wn?'^, ^«P^«s«"t cells transfected in duplicate with pMerSB 

(Example 1L) [Cols. 1 and 2] or with pMerSB and plVIAS2 (Example IN) {Cols 3 and 4] 

2 0 .n ind^ Shi! r f I T ^^'^^ ^'""''y °' ^ ^'^""^ mammalian cell line containing 
2 0 an inducible GAL4-estrogen receptor l.b.d.-VP16 transcription activator in the presence (+) or 

absence (-) of estrogen. ' 

ciuo nl^' I '1 f "^^[S^^PV'""'"^ ^^^^ ^""^^'^ "♦^osen regulation of Beta ITAM-Fyn 

SH2-dependerit two-hybrid formation in mammalian cells observed by transient transfection ( FIG 
9. panel A columns 1-4 of the graph) or when using a stable cell line (FIG 9, panel B. column 5 

2 5 and 6). The presence (+) or absence (-) of estrogen is indicated. The columns represent cells 

r^r«mnin1m fn^'^^l^ '^"'.f o^^^^ ^^''^'"^'^ ^ 2) or With plWerSB and pMAF2 

(Example IP [Cols. 3 and 4). Columns 5 and 6 represent a stable cell line (prepared as described 
in example 2 containing both pMerSB and pH/IAF2 cultured in the absence (column 5) or presence 
(column 6) of estrogen. 

30 

Detailed Description of the Invention 

mprhJIl!m ^^'^'f^^^ ^""-^^^ "«eds in the art of drug discovery by providing a 

mechanism-based. cellular two-hybnd assay for identifying inhibitors of key molecules involved 

3 5 unkntr.? P^j^l^^ys- This assay permits the detemiination of whether a test or 

nrn^inr "P^'« °' "'"^'^'^S « ^^^^'^^^ phosphopeptide-mediated association, and 

?r?nv«;rnT ''^^ r '^^"^'^'^^ tunctlonaHy characterizing such inhibitors. 

JlA or n^hll nH T >^f. '''t"'^ °' '"^'"''"^^ °' association between an SH2. 

ZZr P^^S"""^"*'*^! "'"^'"^ °' = 9iven protein and a naturally occurring ligand 

4 0 lomliJn ' ^° <»^«'"selves useful in therapeutic and pharmaceutical 
4 0 compositions and regimens. 

p..r,n?^°"^ n*^®' features this invention provides genetically engineered cells, preferably 
.nhihi7o ,K *°J "'^^ <»^^o"9»^P"t identification of compounds which interfere with. 

rnJnS -^^T^ .mpede interactions mediated by the binding of a tyrosine-phosphorylated 
exoresstS f P^°fP^°P«P»«^« ^""^'"9 Protein. The cells of this invention contain and are^able of 
3Z?nH '^ H ^t? ^."19^"^^ «"~*"9 ^° '"sion proteins. One of the fusion proteins 
^o7« n, ^^''^?'^? K ^""^'"s « "9a"d for that PBD. The two fusion 

i;;rr?p;;rorr rtri^^^^^^^^ ^^^^ °' -'--^ - 
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One or both of the fusion proteins may also contain one or mor optional domains or elem nts, 
mcluding a domain capable of regulatably sequestering the fusion protein at a desir d cellular 
location or compartment or in an inactive conformation. 

In the practice of this invention, cells containing the components mentioned above are 
5 cultured under suitable conditions and in a suitable culture medium permitting cell growth and a 
detectable level of expression of the reporter gene. A test substance is added to the medium, the 
cells are cultured for an appropriate incubation period, i.e. permitting an inhibitor to enter cells, 
block two-hybrid formation or persistence and effect a detectable change in reporter gene 
expression. The level of expression of the reporter gene is then measured. A decrease in the level of 
1 0 expression of the reporter gene indicates that the test substance is a candidate inhibitor. 

The assay may be repeated using a control cell containing the reporter gene and its associated 
control elements and further containing and being capable of expressing a gene encoding a 
transcription factor fusion protein as a positive control. The transcription factor fusion protein 
contains both a DNA binding domain and a transcriptional activating domain permitting it to actuate 

1 5 transcription of the reporter gene. The DBD and TAD are typically the same domains which are 

separately present in the respective DNA-binding and transcriptional activating fusion proteins, 
previously described. Alternatively, the control cell may express in place of the transcription 
factor fusion protein, two altemative two-hybrid fusion proteins which were designed based on a 
different protein binding pair than the phosphopeptide binding pair represented by by the primary 

2 0 two-hybrid assay cells as described above. The failure of the test substance to significantly alter 

the level of expression of the reporter gene in such controls supports the conclusion that the test 
substance is not generally inhibiting the assay or inhibiting the assay read-out via an irrelevant 
mechanism. Such controls help confirm the phosphopeptide binding inhibitor status of candidate 
inhibitors. 

25 

I. Definitions 

To facilitate understanding of this invention, the following definitions are provided: 
A "peptide ligand" as used herein defines a polypeptide motif or fragment thereof which 
contains one or more tyrosine residues, and which is capable of binding to a phosphopeptide binding 

3 0 domain such as an SH2 or PID domain when at least one of the peptide ligand's tyrosine residues is 

phosphorylated. A peptide ligand may be an oligopeptide, polypeptide or protein fragment or 
portion. (The terms "polypeptide" and "protein" are used interchangeably herein.) 

A 'phosphopeptide binding domain" (PBD) is an amino acid sequence or polypeptide region, 
typically present in certain signaling proteins, which is capable of selective and specific binding to 

3 5 a characteristic tyrosine-phosphorylated ligand. The binding of a tyrosine-phosphorylated ligand 

to a PBD is considered to mediate or direct the association of the proteins and the transduction of the 
intracellular signal. 

A "phosphopeptide binding pair" (PBP) consists of a PBD and a peptide ligand therefor. 

An "SH2 domain" is an example of a PBD. An •SH2 ligand" is an example of a peptide ligand. 
40 A "phosphopeptide binding pair inhibitor" Is a compound which can be shown to to inhibit the 

fonnation or persistence of binding between a PBD and a tyrosine-phosphorylated peptide ligand 
therefor. A phosphopeptide binding pair inhibitor may function by binding to a PBD with 
competitive avidity vis-a-vis a tyrosine-phosphorylated peptide ligand for the PBD. binding to a 
tyrosine-phosphorylated peptide ligand for the PBD with competitive avidity vis-a-vis the PBD. 

4 5 or othenwise disrupting the nonnal binding between a PBD and a tyrosine-phosphorylated peptide 

ligand. 

A "DNA Binding Domain" (DBD) is a polypeptide or protein domain, e.g. a portion of a 
transcription factor, which binds to a DNA sequence, typically a transcription regulatory DNA 
element linked to a target or reporter gene. 
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A "Transcription Activator Domain" fTAD^ » r«^lw«»«.•^ 
of a transcription factor, which when ii nraSiZ f 'f' ^P^P*"^^ °' P'°<«'" domain, e.g. a portion 

target gene, activates gene trL^criptirn ST^^^^^^^^ ^^9"'^^°^ D^'^ of a 

sequence). scnpiion (and is typically linked to a nuclear localization 

tZzx2ltj::^, ref~nscS:r ^ t^^^'^ — 

upstream of a target or reporter gen^ which bind, fo fn no^ sequence, typically 

to a. lirc^er : VsT: ^ ~r °' o, a OBD fused 

protein composed ol^at least o"e copy of a ^AD fused ?oa^ r T''"''" ^ 
PBP. '^'^ o' a I Au lused to at least one copy of the other member of a 

• I. The Engineered Cells of the Invention 

A The Host Cells 

or a.^::^^:::^^ "ed^v— herver"°^°^^^ -^-^^ 

However, mammalian cells are DreseniiS nrl ! T this invention, 

those of mouse, hamster rat rabbTd^^ Z o '"='"^6. without limitation. 

heterologous DNA comnrisinn a r»norto, « ^''P^®®*'°" o' the fusion proteins as well as 

responsL to the ChyZ" ompt ^fThf^ °' ^ 

activator fusion protein ^ DNA-bind.ng fusion protein with the transcription 

DNAs or as one or 1" DNAs X .^^^^^^^^^^^^^ '"^"'"'^^^ - -P-ate 

S. Reporter Gene Constructs 

a,. - — --^ 

reaction resulting in a detectable color rhTnn! encoding an enzyme which catalyzes a 

viability in the presencroran^herLis^^^^^^^^^ P"^"^''^ 9ro^ or 

a repressor or suppresLr ofa bScai funSion "L L ''''^"'^ °* «^««"«a' ""trient. 

expression of the "reporter" aene or ^m J^ ^ ^'^'"'^'^^ °^ enhanced by the reduced 

Examples of rjo^^ g^es wwTrSl! T""' conveniently detected, 

without limitation, genes encodTng XmTs Ll^^^^^ °' 
phosphatase and horseradish oeroxid^I r^ . I"c.ferase. beta-galactosidase. alkaline 
generating readily d^tlSS ZTc s^A^^n^^^^^^^^ -^^>V- actions 

detectable cell surface marker such as Thvi T 1 ^' ^^.^J^Porte^ 9ene may encode a readily 
convenient, and preferabrc"r^«^^^^^ P^^^f " - for which the^e is a 

exemplified herein, is the gene encoding sec eted afkalinp Tl *"'°ZV^P°^^' ^ene. which is 
Gene. 66:1-10 (1988)] secreted alkaline phosphatase (SEAP). [Berger. J., et al., 
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The production of SEAP may be monitored from the culture media without the need to lyse the 
ce Is. This permits conv ni nt detection, using a conventional colorim trie or fluorescent assay at 
different time points during the assays of the invention. 

The reporter gene is linked to a DNA element recognized by the DBD and permitting qene 
5 expression in response to two hybrid formation (i.e., complexation of the first and second fusion 
proteins). The DNA element may comprise a synthetic promoter sequence consisting of multiple 
DBD binding sites upstream of a TATA box element. The number of such binding sites can be 
adiusted to optimize the overall level of reporter gene expression induced by the fusion proteins of 
this invention. Examples of such binding sites include GAL4 binding sites (see e.g Acheson "DNA 

1 0 Tumor Viruses". Cold Spring Harbor. NY. (Tooze. J. ed.. 1980) pp 151-160: Peden et al.' Science 

209:1392-1396 (1980): Liu & Green, Cell. 61:1217-1224 (1990); and Ptashne & Gann 
Nature. 346:329-331 (1990)]. 

The DNA element of the reporter gene, when introduced into cells, will preferably not be 
occupied in the absence of exogenously added DBD of the second fusion protein. The extent of 
occupancy of the DNA element may be determined by measuring the level of reporter gene 
transcnption in the absence and presence of the the fusion proteins, the kinase domain or the 
transcription factor fusion protein. 

By way of illustration, one particular embodiment of the invention uses a reporter gene 
construct containing DNA encoding SEAP linked to a DNA element containing iterated GAL4 binding 
sites. Additional guidance in the use of suitable promoters for this reporter gene may be obtained 
from Fields et al. US Patent No. 5.283.173 (1 Feb 1994) and Vasavada et al, Proc. Natl. Acad. Sci 
USA. 88:10686-10690 (1991). 

C. Fusion Protein Constructs 

2 5 The two fusion proteins of this invention typically contain either (a) one or more DNA 

binding domains ("DBDs". which may be the same or diHerent), or (b) one or more transcription 
activation domains (TADs". which may be the same or different), in addition to one or more 
phosphopeptide binding domains or peptide ligands therefor. For instance, in one embodiment, the 
first fusion protein has an SH2 domain or a tandem SH2 region (two SH2 domains) and a 

3 0 transcription activation domain, while the second fusion protein has a DNA binding domain and an 

SH2 ligand domain, capable after tyrosine phosphorylation, of binding with the SH2 domain(s): 

1st fusion protein: SH2 domain-TAD 

3 5 •2d fusion protein: DBD-SH2 ligand 

In preferred embodiments, the second fusion protein contains a hormone ligand binding domain and 
a protein tyrosine kinase domain linking the DBD and SH2 ligand domain, as discussed in further 
detail below. 

40 The DBD and the TAD of the two fusion proteins typically originate from a transcription 

factor. The DBDs and TADs may be derived from the same or different transcription factors and may 
be optimized by genetic engineering. DBDs and TADs are often derived from transcription factors 
having separable DNA-binding and transcription activation domains. 

4 5 i. Transcription Factors 

A large number of transcription factors are known which require two subunits for activity, 
that is, a single transcription factor can be divided into two separate functional domains (e.g. a TAD 
and a DBD). While each domain is inactive (transcriptionally) by itself, the paired components, 
when in close proximity, comprise an active transcription factor. 
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Transcription factors which can be used in this inv ntion include yeast GAL4 which can be 

5 283 17?.° On° ^^1' (1987) and US PatenT No 

5 283.173J. One may use. for instance, a fusion of GAL4(1-147)-SNF1 and 

p^rt^rdr/j^^^^^^ 

Each Of these proteins will have a linKed parer'pa"' oTIn rSgTa^te?^^^^^^^ 
(binding partner recognition) will bring the TAD (VP16) to DNA (GAL4 or Hfip 1 TnH 
transcription Will be induced. Other transcription factors\hat ^y be useS f^^^^^^^^ of this 

Scti Soi'"hn?3 th -'/-♦"««<^ tt^e -embers of the Jun, Fos. and ATF/CREB fam t 
h« . 1 !'u'^ '^"P*"' superfamily, and the like. Other TADs and DBDs which mav 

be used in the practice of this invention are well known in the art ^ 
The transcnption factor can be endogenous or exogenous with respect to the desired host cells 
If the transcription factors are exogenous, but functionafwithin the host and 0^00X^1 wit^^^^^^^ 
endogenous RNA polymerase (rather than requiring an exogenous RNA polymerase fofwh ch a oeJ 

^^tarL"''"fT^' '''k element functional wrth trOBDs o T^Ds of the 

transcnption factor can be provided with another construct for regulating transcription of the 

tTeTo^^no^s p omore:;:oif " °' T'""'°'' the' ene r-oc^a^^^^^ 

ine exogenous promoter region, i.e.. the reporter gene(s). 

2 0 ii. DNA Binding Domains 

hindin?')!?™ domains for use in this invention may be selected from a wide variety of DNA- 
cS^r^th. r ""7" ^"^'""'^^ DNA-binding domains of the homeodomar 

class the zinc-finger class, and the paired-box class, for which numerous exampteTare knTwn in 

DNA rav2^ °" °' the protein wl 

substl" ons '''"'"'"y °' ""^y engineered by amino acid 

A preferred feature for DNA binding domains is the lack of interactions with other cellular 
t'heTe'IL^ k'' T'' °' b« known with precirn so ht 

dor^irralTonTh'nf; k'T '"'^"^ ^^''^ substitutions within the DNA-binding 

?NTSnla e Ih ^ h'"' J"'""'" "'"'^^"'"^ ''^ «yP« utilized. The 

undulv^^n Z!h ^ -ntroduced into cells, will preferably not be occupied (at least not 
mav be dlrmS^^^ exogenously added DNA binding domain. The level of background occupancy 
may be determined through appropnate controls as discussed previously 

Cell-type specific factors (not found in the cells used for the in vivo assay) DNA bindina 

(GruT^Iberq D A NatSn ^^^f "'''"'^'"^ ^^"^-^ - derived from the Phoxl proteir 

h««!^ ^' • Alexandre. C. and Oilman. M. 2. (1992). Human and Drosoohila 

?089-forLnh?n': "''"'^^ DNA-binding activity of serL reUnse fa'or. sZS 25^ 

class A 69 2ml acldlT°"H""'"'r; "^'^ « '"^'"^^^ the homeodornain 

rlrnnnL ^^o-ain derived from Phoxl is sufficient to bind DNA. High-affinity DNA 

tha^Coe 'tfnr' v'™'"'" '^^'^ -'"ino add subs uSLns 

that change its DNA recognition specificity and that affect its ability to interact with certain 
endogenous proteins in human cells. (Grueneberg et al) 

eJr»^ ' T ^' ^*P'®^s'°" '^'on'ig ot a novel zinc-finger protein that binds to the c-fos 

serurn response element. l^ol.Cell. Biol. ,2. 2432-2443; Genbank acces^on number M88579) 

Anv ofrntr,""'"; °' ^'^^ °* ^'"^-^'"^^^ P^°»«'"=- •« ^as seven tandem zinc finger 
Se^e i £na T ^°'"^'"f °^ combination of two or more domains can be used to 
generate a DNA-b.nding domain with novel recognition specificity. Furthermore, because the 
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general structure and nnode of DNA rec gnition is known for protein$ of this class, DNA recognrtion 
can be directly modifi d if necessary. 

A third suitable DNA-binding domain is the GAL4 binding domain referred to previously. 

A fourth suitable type of DNA-binding domain is a composite DNA binding domain as described 
5 In published' International Patent Application No. WO 96/20951 and exemplified by ZFHD1. See 
also Pomeranz et al. 1995, Science 267:93-96. 

I i i . Transcription Activation Domains 

Preferred transcription activation domains are compact in size, potent in activity, and 
1 0 nontoxic to cells. Additional features become relevant lor particular applications. For example, for 
constructs to be used in one particular type of cell or tissue, activator domains with especially high 
activity in the host cell are used. In applications that involve the stable integration of the target 
gene into host cell DNA. activator domains that resist chromatin suppression are used. 

Numerous activation domains are known in the art. Two illustrative activation domains which 

1 5 may be used int he practice of this nvention are the VP16 activation domain and the NF-kB p65 

activation domain. See e.g. International Patent Application Nos. WO 96/20951 and WO 96/41865. 

iv. Phosphopeptide Binding Pairs 

Each of the fusion proteins contains either at least one TAD or at least one DBD linked to at 

2 0 least one member of a PBP, i.e. to at least one phophopeptide binding domain or peptide ligand 

therefor. One exemplary PBP is an SH2 domain and a peptide ligand therefor (e.g., a ZAP SH2 
domain and an immunoreceptor activation motif such as the TCR zeta ITAM). As noted previously, 
the ligand domain will contain one or more Tyr residues, which must be phosphorylated to bind to 
the SH2 domain. By way of further example, the peptide ligand domain may be an IgE ITAM, the 

2 5 polyoma middle T phosphorylation sequence or a synthetic consensus tyrosine phosphorylation 

binding site. 

In some embodiments, inclusion of protein domains that are not generally regarded as 
influencing, either allosterically or otherwise, the behavior of the SH2 or PID or other such 
domain may be included to stabilize the folding, enhance expression or provide a means of 

3 0 identifying or manipulating the protein (e.g. fusion to glutathione-S-transferase or other domains 

that aid in identification/purification, epitope tags, etc). 

As illustrated below, the fusion proteins may contain a plurality of PBDs or ligands 
therefore. For example, one emlxjdiment of the present invention uses a fusion protein containing a 
tandem SH2 domain from ZAP-70 (containing two SH2 domains). 

3 5 Peptide ligand(s) members of the PBP for use in the fusion proteins are defined above. For 

example, where one member of a binding pair member is an SH2 domain, the other member, i.e . 
the peptide ligand, may be a naturally occurring ITAM containing at least one Tyr. Peptide ligands 
in their nonphosphorylated state will not bind the PBDs. These ligands must be phosphorylated by 
endogenous or (introduced) hetterologous protein tyrosine kinase activity within the cell. 

4 0 A large number of SH2 domains, PIDs, and other phosphopeptide binding domains, as well as 

ligands therefor, are known in the art and may be adapted for use in this invention. The following 
additional background information and guidance may be of interest and use to the practitioner. 

Identification of SH2 or SH2-like Domains 
4 5 The term "SH2-like domain or a subdomain thereof" refers to a sequence which is 

substantially homologous to a Src homology region 2 (SH2 region), or a subdomain of an SH region 
preferably a conserved region of an SH region. The Src homology region is a noncatalytic domain of 
-100 amino acids which was originally identified in the viral Fps and viral Src cytoplasmic 
tyrosine kinases by virtue of its effects on both catalytic activity and substrate phosphorylation (T. 
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Pawson. Oncogene 3. 491 (1988) and I. Sadowski t al.. Mol. Cell. Biol. 6, 4396 (1986)) SH2 
domains have been found in a variety of eukaryotic proteins, some of which function in 
intracellular signal transduction. Many are known inthe art. Examples (including counterparts 
from vanous species) of SH2 domain-containing proteins include (1) members of the src-family 
protein tyrosine kinases (Src. Lyn. Fyn. Lck. Hck. Fgr. Yes). (2) She (3) Tsk. (4) Btk (5) VAV 
(6) Grb2. (7) Crk. and (8) signal transducer and transcription (STAT) proteins. In addition a 
number of proteins, such as ZAP-70. p85 phosphatidylinositol 3' kinase (PI3K) Syk GTPase 
Activating Protein (GAP), and Phospholipase C gamma, have two SH2 domains. SH2 domain- 
containing proteins have been identified in human, rodent, sheep, bovine, C. elegans Drosophila 
Xenopus, flatworm. freshwater sponge, and hydra. 

One way to identify new SH2 or SH2-like domains from unknown DNA. RNA or protein 
sequence is by using one of many available computer alignment programs. One example is pfscan 
which can be run via the World Wide Web (WWW) site at 

http://ulrec3.unil.ch/software/pro(ilescan.html. To use the program, a protein sequence is tested 
against a "profile" describing the SH2 domain motif. According to the program information the 
particular strength of profiles is that they can be used to describe very divergent protein motifs 
These profiles are nomiaily derived from multiple alignments of the initial sequence set. In 
addition to the sequences themselves, a profile identifies which types of residues are allowed at 
what position within the domain, which amino acids are conserved, which ones are not which 
positions or regions can allow insertions, and which regions may be dispensable. Additional 
information on Rscan and PROStTE can be obtained at the web page 

http://ulrec3.unil.ch/index.html operated by the Bioinformatics Group at the ISREC (Swiss 
Institute for Experimental Cancer Research). 

As an example we analyzed the peptide sequence of human Src with the pfscan program. The 
results are shown below. The program clearly identified the SH2 domain of Src as encompassing the 
region from ammo acids 150-247 of the Src peptide sequence. In addition, the SH3 and kinase 
dentins were identified by pfscan. 

NScore raw from-to Profile I Description 

26.9695 1792 pos. 150-247 PS50001 | SH2 Src horology 2 (SH2) dotain 

f^'foVc I ^ Srchatology3 (SH3) doimn 

43.4246 2912 pos. 269-522 PS50011 | PROIEIN_KINASE_IXM Protein kii^e 

The NScore of a match is the negative decadic logarithm of the expected number of matches of the 
given quality (or better) in a random database of the given size. For NScores «1 this converges to 
the probability of finding the match in the database. Since the number of expected matches depends 
on the size of the database, the decadic logarithm of the database size must be subtracted before the 
calculation: 



-log(NExp) = NScore - log (DBsize) 

where (NExp=Expected number of chance matches) and (DBsize=size of the database in 
characters). 

. e ''".^^J°"°'^'"9 ^able gives somes examples on how to convert the NScores into probabilities for 
the SwissProt database and the nonredundant (nr) protein database. The calculation is based on a 
database size of 



18,531,385 residues for SwissProt (log=7.27) 
58.154,119 residues for the nr database (log=7.76) 
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Expected chance matches in: 
NScore SwissProt nonrecaLindant 
7.0 1.8 5.8 

5 7.5 0.58 1.82 

8.0 0.18 0.58 

8.5 0.058 0.182 

9.0 0.018 0.058 

9.5 0.006 0.0182 

10 10.0 0.0018 0.0058 

10.5 0.0006 0.0018 

...and soon... 

The segment of a test sequence contains an SH2 domain with an SH2 profile NScore value > 7 5 

1 5 preferably > 8. more preferably > 9, more preferably > 10. 

As a second example, the N-tenninal 160 amino acid sequence from human ZAP-70 was 
applied to ptscan. The result indicated an SH2 domain bounded by amino acids 10-102. 

NScore raw fran-to Profile | Description 

2 0 16.4402 1082 pes. 10-102 PS50001 | SH2 Src hoiology 2 {SH2) datain 

The minimal segment encoding an SH2 or SH2-like domain as detemiined by sequence 
alignment may not be enough to function or function optimally in the assay. Additional sequences 
either N-tenninal and/or C-terminal of this minimal segment may be necessary or desirable for 

2 5 optimizing the binding of the encoded protein to a peptide ligand within the context of the invention. 

These additional sequences may be derived from the natural protein sequence or may be derived 
from other proteins or even non-natural sequences that are added to the expression vector. 
However, the sequence required can be readily detemiined biochemically, in an in vitro binding 
assay, or genetically, using the production of SEAP as an indicator of SH2 domain-phosphopeptide 

3 0 interaction. 

SH2 domains can be identified using other computer alignment programs, such as MegAlign 
within the DNAstar computer package (Madison. Wl). To do this, one or more known SH2 domains 
and a test sequence are aligned by the clustal method. A sequence having > 25%, in some cases 30 - 
50 %. in other cases > 50%, amino acids identical to a known SH2 domain is identified as an SH2 

3 5 homology domain. The positions of identical amino acids between the test sequence and different 

known SH2 domains can vary, except for one position. All SH2 domains identified to date have a 
conserved arginine residue approximately 25-40 residues from the start of the SH2 homology 
domain. In human src this arginine is found within the sequence FLVRES, where abbreviations for 
the amino acid residues are: F. Phe; L. Leu; V. Val; R. Arg; E. Glu; S. Ser. 

4 0 Another way to identify SH2 or SH2-like domains is by running a query in the federated 

nucleotide or protein databases for the SH2 domain feature. In the SWISS-PROT database, this is 
listed under the FT or feature" heading. SWISS-PROT database can be accessed over the WWW at 
EBI http://www.ebi.ac.uk. For example, in the file listed for human Src (PI 2931). the region 
containing the SH2 domain is shown to be 150-247 
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SOTSS-PROT: P12931 
ID SRC_HUMftN SEMCARD; 
P12931; 

MIM; 190090; -. 
roOSITE; PS00107 
PRDSriE; 
PRDSriE; 
PRDSITE; 
EROSITE; 



AC 
ER 
ER 

m 
m 

ER 
ER 
DR 
ER 
KW 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



PRT; 535 AA. 



PRCTEHlt.KlNASELATP . 
H^DTEim_KINASE_TYR . 
SH2. 
SH3. 

PROriEIN_KINASE_DCM . 
PRODCM [tonain structure / List of seq. sharing at least 1 donainl 
SWISS-2DPAGE; GET REGICN CN 2D PAGE dotmnj 

MfRISIYLATICN; SH3 DCMAIN; SH2 DCWAIN. 



PS00109; 
PSSOOOl; 
PS50002; 
PS50011,- 



ATP-BINDING 

LIPID 
DCMAIN 
DCMAIN 
DCMAIN 
NP_B1ND 
BI^IDI^G 
ACT_SITE 
^DD_RES 
MDD_RES 



83 
150 
269 
275 
297 
388 
419 
529 



0 
1 

144 
247 
522 
283 
297 
388 
419 
529 



BV SIMrLARFIY. 
MVRISIATE (BY SIMnARTIY) 
SH3. 
SH2. 

PROTEIN KINASE. 
ATP (BY S3HIIARITY) . 
ATP (BY SIMILARnY) . 
BY SIMILARITY. 

FHOSEHDRYLATICN (AUID-) (BY SIMILARnY) . 
HiaSFIOCifLATICW (BY SIMHARTTY) . 



CDNA exarr.ln , "^f "'"V SH2 or SH2-like domains may be accomplished by screening a 

SnL for lHTnrl n ^ ''':w'P'°'^l!L^^ """"'^^ "9^"'* ^ SH2 domain .o isolate 

probe. The SH2 domain or protein containing the SH2 domain may be isolated from naturally 
erkrotic'ciSr ^"t ""T- "^^^""^ recombinantly in bacteria, yeas, or 

sianific?nt hom^rIilfn°' ^tK ""cuf? ""^^ ''^ P~9^3^ 

D oarams Th?^' ^""^ sequences to be detected by computer alignment 

sS^e as ^^oL%HP h"' ^'^h'bit the same or similar three-dimensional 

Dhorohotvro^nr . ^" SH2-like domain and function to bind 

kJZ iH^Tm I °' "^^^ three-dimensional structure of several 

sZs cor^oosTd o,% r« f T'"o'- '"'""'"^ Characterized as two anti-parallel beta 

lifh!i moT IL ® "^9'°"* '°''"'"9 an alpha helix may or may not be present 

^ulre whTrnlvr. ""T'"' ^^'^^^"'"'^ •^^^'"g SH2'like dom'a" 

stmmre bv hint V;'"^ crystallography or NMR spectroscopy. Alternatively, a predicted 

like domaS. ""^^ *° '""""'^ ^ as an SH2- 

http7/ulJLltnT.h/nIf^H '.'r7f ^ "''^ to generate the SH2 profile for pfscan. as taken from 
h tD7/lf^^l^ r/'^ <P^°'"« "matrix can be obtained from 

SnJ^^^ nc , ^ ''^'"'''"'^^-^''^''^^"2) "ased on alignment of approximately 390 SH2 

Sm"em Tn 'thTswisI ProrOatr"- f ' °' '°"^^-"9 SHs'domSnl Ld in the 

Dlba:rA;ersron nTmbe" " ""'^'"''^ ''''''' ^wiss-Pro. 
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P00519, 
P00522, 
P00525, 
P00528, 
P00542, 
P03949, 
P06239, 
P07332, 
P08103, 
P08631, 
P09851, 
P10936, 
P13116, 
P14085, 
P15054, 
P16333, 
P16885, 
P19174, 
P23726, 
P24604, 
P26450, 
P27870, 
P29350, 
P29354, 

2 5 P32577, 

P35991, 
P41239, 
P41242, 
P42224, 

3 0 P42227, 

P42230, 
P42G79, 
P42682, 
P42685, 

3 5 P42688, 

P43403, 
P46108, 
Q02977, 
Q04736, 

4 0 006124, 

Q07883, 



ABL1_HIMAN 
ABLJOCME 
SRCJWISR 
SRC1_ERCME 
FE5_FSVGA 
ABL1J2AEEL 

FES_HUMRN 

HZK_HUMAN 
GrPA_BCVIN 
VES_XENLA 
SRC2_XENLA 
SRC_AVISr 
SRC_AVIS2 
^CKJ^UMRN 
PIP5_ilLMAN 
PIP4_HUMMSI 
P85B_BCIV3N 
TEIQJOJSE 
P85A_MUSE 
VAV_fOJSE 
PIN6_H[MAN 
G3RB2_JiLIMAN 
CS5<^RAT 

CSK_CHICK 
CIKJCUSE 
SIAIJHUMAN 

SrA5_ftXJSE 
CIKJCMAN 

SRK2_SP0LA 
ZA70jiUMRN 
CRKJttJMRN 

VESJEUSE 
PINB_HCMVN 
GRB2_CHICK 



P00520, 
P00523, 
P00526, 
P00530, 
P00543, 
P05433, 
P06240, 
P07947, 
P08487, 
P09324, 
P10447, 
P12931, 
P13406, 
P14234, 
P15498, 
P16591, 
P17713 , 
P20936, 
P23727, 
P25020, 
P27446, 
P27986, 
P29351, 
P29355, 
P34265, 
P39688, 
P41240, 
P41243, 
P42225, 
P42228, 
P42231, 
P42680, 
P42683, 
P42686, 
P42689, 
P43404, 
P46109, 
Q03525, 
Q04929, 
Q06187, 
Q08012, 



ABI^JECJSE 

SRCjCmCK 

SRC_PSVP 

FPS_niJSV 

FESJFSVSr 

GAQCJVVISC 

I£IUCUSE 

PIP4_BC(VIN 

YES_CHICK 

ABL_FSVHY 

SRQJiUffiN 

FYN JOILA 

FGRJsXXJSE 

VAVJiUMAN 

FERJUMRN 

GrPA_HUMAN 

P85A_BCIVIN 

SRC_PSVH1 

FXN JOPHE 

P85AJUMAN 

P1N6 _MXJSE 

SEt6_CAEEL 

VKFljCflEEL 

FnqjCUSE 

CSK_HLWAN 

SIM_^CUSE 

STA4_fOUSE 

SIA5_SHEEP 

TBCJiLWRN 

LOeCHICK 

SRK1_SPQIA 

SRK3_SPQLA 

ZA70_MX]SE 

CRKL<JiJMAN 

rnucusE 

cRKjancK 

BIKJHLMRN 
ERKJCIOIE 



P00521, 

P00524, 

P00527, 

P00541, 

P00544, 

P05480, 

P06241, 

P07948, 

P08630, 

P09769, 

P10686, 

P13115, 

P14084, 

P14238, 

P16277, 

P16879, 

P18105, 

P23615, 

P24135, 

P25911, 

P27447, 

P29349, 

P29353, 

P31693, 

P35235, 

P40763, 

P41241, 

P41499, 

P42226, 

P42229, 

P42232, 

P42681, 

P42684, 

P42687, 

P42690, 

P43405, 

Q00655, 

Q04205, 

Q05876, 

Q07014, 

Q08881, 



AHL JJLVKB 
SRC_PSVSR 

FPS_AVISP 

SFiCN JCUSE 

FYIOOIAN 

LYNJOCVN 

SRC2_P04E 

FGR_HJMRN 

PIP4_PAT 

SRC1_;{ENLA 

SRCJWISS 

FES_FE3JCA 

FES_fCUSE 

FPS_ERCME 

SPr6_YEASr 

PIP5_PAT 

LVNjyDUSE 

YES_XrFHE 

CSWJ30ffi 

SRC_RSVPA 

PIt©_MXJSE 

SrA3_HUMfiN 

CSK^fDUSE 

PITIB_RAT 

SEA2_HUMAN 

SrA5_HUMRN 

SlABjyDUSE 

ABL2_HUMfiN 
SEKlJiXJGfn 
SRK4_SP0LA 

SYK_PIG 

TINS_CHIC2C 

F5nH_CHICK 

UK HUMAN 



A general method to identity an SH2 domain within a test peptide or nucleotide sequence follows: 

4 5 1 . Translate the cDNA or RNA into single letter code protein sequence. This could be accomplished 
using a computer program such as DNA strider or EditSeq in the DNAstar package. 



2. Go to the WWW site at http://ulrec3.unil.ch/software/profilescan.html 
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3. Copy the test sequence into th appropriate box in the pfscan form 

4. Submit the f rm to the pfscan server 

5. The results are sent back through the web browser or via e-mail. 

The suljject invention is relevant to SH2 and SH2-like domains as described in the foreaoina 
paragraphs. Usmg mfomiation provided herein, and by analogy to the examples provided bZ one 

roen?d7Ha;lH^H "".'"^ ^"^'like domain. PID or PID-Uke domarand 

a peptide l.gand therefor, e.g. in place of ZAP. Syk. Src or Fyn SH2 domains. 

Identification of PID or PID-iike Domains 

An alternative phosphotyrosine binding domain to SH2 domains is the so-called 
phosphotyrosine interaction domain (PID). This domain, containing on average about 160 amino 

ilnT.« r- ^""^ ^° SH2 domains, which 

rewgnize sequences having a consensus pTyr-Xaa-Xaa-Xaa-Xaa (a phosphotyrosine followed by 

re^evan to p!d 1h Pi^ 11 """I ''l^^^' ""^^ '^^'^'^"'^^^ ^PP"«=««an is also 

ISlIpH r„?K! ! '=°'^'"9 3 P'D domain is 

substituted n the appropnate vector for the SH2 domain coding sequence and a ligand that 

h« XTn. K J ""^'2 "9^"^- Phosphorylation of the PID ligand could 

Shn.^ho^?f ?.? T^r'^'^.' '^""'''"^ P^°»«'" kinases could be used to 

of th. P?n'^o^"' '"'°'^^"?" ^"'"9 do'^ains is known in the art. A detailed description 
K r ,K ^ «f' ''^ °" a< »he site http://www.bork.embl- 

heidelberg.de/Modules/pid-gif.html. The following information is taken from that site: 

Documentation - PROSITE description 

Beside SH2.the phosphotyrosine interaction domain (PI domain or PID)I3] is the second 
nSn^'Tr?'"'^'"^ in the transfomiing protein She [1.2). She couples 

mirmllr'' . '"'^T' '° ^ P^'^^^y '^^^ ^^e proliferation ^ 

^nasTs ?h/S r ""9^.;.P^^'P^«« transforming activity of oncogenic tyrosine 

t^^osinenhncnh ??T ^P"""^^"^ ^-nds to the Asn-Pro-Xaa-Tyr(p) motif found in many 
trSh:'^ rof . f'T'."^ 3'°^^^ ^«'=«P«°^=- ^^^s also been found in 

Ire lifted below ' " ^^^ulatory proteins 13) which 

* liT"^"^^ ^i'' ^"^^ ^"'^ ''^ one N-terminal PID. a collagen-like 
domain and a C-temiinal SH2 domain. «-uiiayBn ukg 

Human She related protein Sck contains one PI domain and a SH2 domain 

* Mamrnalian X1 1 is expressed prominently in the nervous system. It contains 2 disc 
homologous regions (DHR) of about 100 AA downstream of the PID 

* o3^Lrr'^^H '^^^"^^ 'n determination of cell fate during sensory 
organ formation in drosophiJa embryos. It has one PID 
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• Rat FE65. The WW domain as well as the 2 PlDs found in the sequence of FE65 indicate that 

this protein is probably involved in signal transduction 
. Drc^ophila protein disabled is a cytoplasmic, tyrosine phosphorylated protein found in 

CNS axons and body wall muscles. It is involved in embryonic neural development It 

contains one N-terminal PI domain. 

Mouse mitogen responsive phosphoprotein isoforms P96. P93 and P67 which are produced 
by alternative splicing, contain one N-terminal PID. This is also true for the differentially 
expressed human ortholog Doc-2. 
Human EST05045 protein fragment has one PID. 

References: 

I 1J Kavanaugh WM.. Williams L.T. Science 266:1862-1865(1994) 
{ 2] Blaiki, P. et a!.. J.Biol.Chem. 269, 32031-32034 (1994) 
( 3] Bork P., Margolis B. Cell 80. 693 (1995) 



A PI domain alignment based on approximately 40 PI domains from various species is illustrated in 
the WWW site at http://ulrec3.unil.ch/prf_details/alignments/PID.msf. An alignment based on 
approximately 50 PID sequences is shown at the web site at http7/www.bork embl- 
heidelberg.de/lulodules/pi-ali.html. 
2 0 The minimal segment encoding a PID or PID-like domain as detemiined by sequence alignment 

may not be enough to function or function optimally in the assay. Additional sequences either N- 
terminal and/or C-terminal of this minimal segment may be necessary or desirable for optimizing 
the binding of the encoded protein to a peptide ligand within the context of the invention. These 
additional sequences may be derived from the natural protein sequence or may be derived from 

2 5 other proteins or even non-natural sequences that are added to the expression vector. However, the 

sequence required can be determined biochemically, in an in vitro binding assay, or genetically, 
using the production of SEAP as an indicator of PID domain-phosphopeptide interaction. 

V. Optional Domains 

3 0 One or both of the fusion proteins may also contain one or more optional domains, including a 

protein kinase domain, as mentioned above, or a domain capable of regulatably rendering the fusion 
protein unavailable for 2-hybrid formation (e.g. by sequestering the fusion protein at a desired 
cellular location or compartment or by maintaining the fusion protein in an inactive 
conformation). 

3 5 Kinase Domains, As one example, if the cell does not have an endogenous protein kinase 

capable of phosphorylating the tyrosine residues of the peptide ligand for the SH2, PID or other 
phosphopeptide binding domain, a heterologous kinase activity is introduced into the cell. A DNA 
encoding such a heterologous kinase may be introduced into the cell as an independent transcription 
unit (i.e., with its own transcription regulatory elements, such as a promoter and/or enhancer 

4 0 sequence). Altematively. a kinase activity may be introduced as a functional kinase domain within 

one of the two fusion proteins of this invention. In one embodiment the fusion proteins comprise a 
first chimera containing one or more SH2 domains linked to one or more transcription activation 
domains and a second chimera containing a DBD-kinase domain-pep de ligand domain fusion. 
In one embodiment, the protein kinase domain from the v-Src rotein was used to 
4 5 phosphorylate the peptide ligand linked to the DNA binding domain. Alternative protein kinases may 
be used to conduct this phosphorylation. The recognition of protein kinase domains can be conducted 
by the same approaches described lor SH2 or PID domains using pfscan, ClustalV analysis, 
hybridization, or functional biochemical analysis. A segment defining the protein kinase homology 
region is approximately 280 amino acids. A detailed alignment of protein kinase domains can be 
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Ob amed at the pfscan web site listed above or at the protein kinase web site at University of 
Wnase'dom'Iin ac Sf"" .(»^ttP7/www.sdsc.edu/kinas s/). The minimal segment encoding^a protein 
.en^l»? ''^f ^"snment is bounded by a conserved glycine I the N- 

T M f ^"'^ ^ conserved arginine at the C-temiinal portion. However, additional sequences 
thir hT'"' °* '"'^ '"^y be necessary 0 de ir^Se for 

nva^Z HHr P»'Osphorylate the peptide ligand within the cLtexl o fe 

Z I d Jr^H , ?h" """^"^^^ '"^y '^^ '^-^ ♦^^^ P^°t«'" kinase sequence or 

^T' °' non-natural sequences that are added to thVexpression 
vector^ The sequence required can be readily detemiined biochemically, in an in vitro 
KSfn nH "«*"9 the production of SEAP as an indicator of SH2 (or 

doma^'" ^" appropriate protein kinase would be substituted for the v-Src 

cont»in?f»""I'°" As another example, fusion proteins of this invention may also 

contain a targeting sequence providing for translocation of the protein to the nucleus Sudh a 
ZTl sequence may have a plurality of basic amino acids, referred to as a bipartite basic repeat 
(see. e.g.. Garcia-Bustos et al. B.ochem Rinnh»c ^rtf I-TI (1991)1 This sequence can 

appear in any portion of the molecule internal or proximaWthe N- o C-terminu and resSsTn 
localization of the fusion protein within the nucleus 

thpr^for^n^r binding Domains. Compounds introduced into cells may be labile and 

T^llnZ ^'^Tl °' °' ""'^^ ^"-^ cases, it wotJd 

comlrS w ?H ? ^^'"P^"^"^^ P^«s«n' the cell at the time the cells are exposed to test 
hnZn! r H !• H^'T '""''^"'^•^ '"s'°" P^°'«'"« containing a nuclear 

wh^h ma?h« N"'^^-"- homione receptors and ligands are known 

Which may be used in the practice of this invention, including estrogen, glucocorticoid retinoic 

Sino dnr"^p"' "TT ? '."'T''- ■'•^^ properties of a nuclear hormone ligand 

biding domain (P.card et al. Cell (1988) 54. 1073-1080; Eilers et al. Nature (1989) 340 
n^rition"'' the post-translational regulation of interaction of the fusion proteins. By way of 
o fhtTr;. ^^'^S^ .^^^f ^" autonomous regulatory domain: upon fusion 
LtrorTr^°"'.': ^"''^^^ts Chimeric protein to Somional 

In J P™P««y due to the hormone-reversable interactions of the HBD with 

fu^ln nrl'' K ' ^^^^ein and a DBD-HBD-kinase-PBD ligand 

LdTnr r '^H J' ^« '«P°"«^ 9ene construct. Only with the 

the HBolnZfn 'T" "^^""^ ^'"^ '"'^^^'^t- °' h°^-°ne, 

prLin toT^Te 't::^;S;rom^:er " '-^^^^^ complementary fusion 

r«rnnn^r ^T^V'"^^ ''^ '"^^'^^^^ '^"^i" '"^ion proteins include an antibody 

c^^conZTh'l r'"'"'"^ "^'"9 ^"''b°^'"- Such monitoring can be 

useful to confimi that the fusion protein is made and that the level of expression is not affected bv 
compounds tested for inhibition of phosphopeptide domain binding. ^ 

Gi;.*^!!.!'"*!!'' °^ constructs, Reporter Gene Plasmids and 

Genetically Engineered Cells of the Invention 

can be^ntro'^dTnt^t^^^ ^"^ ""'^'"'"9 9«"«= °t this invention 

can be ntroduced into the cells as one or more DNA molecules or constructs, in many cases in 

Snst';;?ct:s") TheT "^T^ ^^■^'^^^^ °' '^-^ ""^ -^-^ -"aln the 

^"structs can be prepared in conventional ways, where the coding sequences and 

Z^^ldV"":' '""^ «PP~P"«te. ligated. cloned in an appr^ donTng "05^ 

analyzed by restnction or sequencing, or other convenient means. Particularly using PGR 
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individual fragments including all or portions of a functional unit may be isolated, wfi re one or 
more mutations may be introduced using "primer repair", ligation, in vitro mutag nesis etc as 
appropriate. For example, a DNA sequence encoding the DNA-binding domain or TAD(s) is joined to 
DNA encoding ttie appropriate member of the PBP. These sequences are joined such that they 
5 constitute a single open reading frame that can be translated in cells into a single polypeptide 
harbonng all intended domains. The order and arrangement of the domains within the polypeptide 
can vary. The construct(s) once completed and demonstrated to have the appropriate sequences may 
then be introduced into a host cell by any convenient or desired means. The constructs may be 
incorporated into vectors capable of episomal replication (e.g. BPV or EBV vectors) or into vectors 
1 0 designed for integration into the host cells' chromosomes. The constructs may be integrated and 
packaged into non-replicating, defective viral genomes like Adenovirus. Adeno-associated virus 
(AAV), or Herpes simplex virus (HSV) or others, including retroviral vectors, for infection or 
transduction into cells. Alternatively, the construct may be introduced by protoplast fusion 
electroporation. biolistics. calcium phosphate transfection. lipofection. microinjection of DNA or 

1 5 the like. The host cells will in some cases be grown and expanded in culture before introduction of 

the construct(s). followed by the appropriate treatment for introduction of the construct(s) and 
integration of the construct(s). It may be most convenient to use simple transfection procedures 
The cells will then be expanded and screened by virtue of a marker present in the constructs 
Various markers which may be used successfully include hprt. neomycin resistance, thymidine 

2 0 kinase, hygromycin resistance, etc.. and various cell-surface markers such as Tac CD8 CD3 

Thyl and the NGF receptor. 

In some instances, one may have a target site for homologous recombination, where it is 
desired that a construct be integrated at a particular locus. For example, one can delete and/or 
replace an endogenous gene (at the same tocus or elsewhere) with a recombinant target construct of 

2 5 this invention. For hiomologous recombination, one may generally use either £1 or O-vectors See 

for example, Thomas and Capecchi, Cell (1987) 51. 503-512; Mansour, et al.. Nature (1988) ' 
336. 348-352; and Joyner, et al.. Nature (1989) 338. 153-156. 

The constructs may be introduced as a single DNA molecule containing all of the genes, or 
different DNA molecules having one or more genes. The constructs may be introduced 

3 0 simultaneously or consecutively, each with the same or different markers. 

Vectors containing useful elements such as bacterial or yeast origins of replication, 
selectable and/or amplifiable markers, promoter/enhancer elements for expression in 
procaryotes or eucaryotes. and mammalian expression control elements, etc. which may be used to 
prepare stocks of construct DNAs and for carrying out transfections are well known in the art, and 

3 5 many are commercially available. 

Additional background information and general guidance to the practitioner with respect to 
design, assembly, incorporation into plasmids. and transfection of constructs for such fusion 
proteins and reporter genes is available in the following published international patent 
applciatlons: WO 94/18317. WO 95/02684. WO 96/20951. WO 95/24419 and WO 96/41865, 

4 0 the contents of which are incorporated herein by reference. 

IV. The Assay Method of the Invention 

Genetically engineered cells containing the various DNA constructs described above may b-n 
used to carry out the assay method of this invention. Such assays permit one to identify the 
4 5 presence in a test composition of an inhibitor of the binding of a PBP with a tyrosine- 
phosphorylated ligand therefore. Test compositions or compounds to be assessed for their 
inhibitory activity can be obtained from a variety of sources, including, for example, microbial 
broths, cellular extracts, conditioned media from cells, synthetic compounds and combinatorial 
hbranes. and may be tested individually or in pools. The assay method of this invention may be 
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A 77je General Method 



^oconltsToZrot^:2VZ:^^^ 'T^^' P-'^-'-'V a celi in which ,he 

or heterologous kinifcap^ e o? ^roJl oh 'k °' '^^l' '=°"^"^ endogenous 

1 0 method, thf aboveTe:S?l; L^;^^^^^^^^^^ rr .'^11^ "^"'^^'"^ '^^^^'^ 

medium under suitable conditions permitting Sov^h o .^^^^ ^ =°"^«"«°"al culture 

the test composition For examole tha cpIHc I Presence and absence I 

the drugs neLsarJ to selertTor ce Is^^^^^^^ f ""''"^ '^"""^^ '"^'^'^ <=°"«a'"'"9 

cells m% be cultuTe^i^lli: d Le cultu^^^^^^^^^^^ ZIT ''"T 

1 5 other vessels or arrays of vessels as desJed multidishes and m.crowell plates, or in 

cells ^is'u::r;tz:^:^:::::£:;::^ r t ^"^^^ °" 

solution. The compounds to be tesL SI b« rp , „ i °' °' ^ ''^'^""'^ ^^It 

Alternatively, it may brnecSsarv to S tS f" ^"'^ '^'^^'^^'^ 

2 0 tetanolysin or anotLr cerpermeaSizirTg Tgtt ' '""''"'^ "'"^ °' 

eel, is cSrofet^^^^^^^^ .'^^ °' - inhibitor test composition, the 

srs^^cr!~-^^^ 

gene jroducrwSSilhld'in'.r' "'''^'"'"'"^ whether production of the detectable 

ceTor medl are elaluate^^^^^ °' ^O'^POsition. After transcription is induced 

PBP m^^^^^^^^^^ -bility Of the 

the text composition will be about equJ^lent s^ncfthe r«!. f , ^"^ °' 

results Obtained with no test compound as described fb^ve .f L° ^ ^'"^i'ar to 

inhibitor Of the PBP interaction, a decrease in thetveUeoort^^ " ' 

decrease indicates that test comoound inhihitl Vh. ! expression is obsen^ed. This 

Ugand. and thus J^ZlVZ'Zl^ T,'^'2^^^^^^^^ '° phosphory.ated peptide 
expression is thus extinouished or redurprf rifl two-hybnd complex. The receptor gene 
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TCR zeta ITAM in vivo (s Example 3). Compounds that block ZAP SH2 binding to ITAM sequence 
are detected using a first fusion protein comprising the vSrc kinase domain, estrogen receptor 
domain, and the TCR zeta ITAtVI ligand fused to a DNA-binding domain and a second fusion protein 
comprising the tandem SH2 domain of ZAP fused to a TAD. The Interaction of the ZAP SH2 domains 
5 with the zeta ITAM domain brings the activator domain in proximity to the DNA element, inducing 
reporter gene transcription. Compounds that block ZAP SH2- zeta ITAM interaction reduce 
reporter gene expression. 

In this embodiment, and by way of illustration, the assay method of this invention offers 
several advantages over traditional T cell assays. The assay readout is directly dependent on the 
1 0 interaction between the particular SH2 ligand and the SH2 domain-containing polypeptide (e.g., the 
ZAP-70 tandem SH2 domain). In contrast, in standard T cell assays that measure cell 
proliferation, cytolytic activity or cytokine production, there are many steps between activation 
and the assay endpoints. Therefore, many nonspecific inhibitors score in standard T cell assays, 
which will not score in the assay of this invention. 

1 5 

B. EUminating Non-Specific Inhibition 

To exclude test compounds that inhibit reporter gene expression by nonspecific mechanisms 
(e.g.. by having a general inhibitory effect on cellular transcription or translation), test 
compounds are also assayed for their effect on reporter gene expression when reporter gene 

2 0 transcription is not dependent on PBP interactions. For this purpose, an indicator cell line is 

utilized wherein the transcription activation domain is brought directly to DNA by the DBD (by 
expressing in cells a DBD/transcription activation domain fusion). To rule out false positives based 
on othenvise nonrelevant effects of a test composition relating to the HBO, the fusion protein may 
further contain a hormone binding domain (HBD). In such cases, the only difference between the 

2 5 positive control indicator cell line and the assay cell line is the dependence of the assay cell line 

upon PBP complex formation for reporter gene expression. 

General inhibitory effects of transcription are also revealed by performing radiolabeled 
uridine pulse labeiings during exposure to the test compound. Inhibition of general incorporation of 
radiolabel indicates a general inhibitory effect on transcription. 

30 

C Simultaneous Exposure of the Cell to the Assay Components 

In another aspect of this method, the first fusion protein transfected into the cell contains an 
HBD, as described above. The HBD is an important feature of this aspect of the method of use of this 
cell. Test compounds introduced into cells may be labile and therefore not present for long periods 

3 5 of time once introduced into the cell. It would be advantageous to have all interacting components 

present in the cell at the time the cells are exposed to potential Inhibitors of the PBP interaction. 

Inhibitors of phosphopeptide binding protein-protein interactions could exert their effect in 
two ways. They could compete with the ligand/HBD/TAD domain for the PBD or displace the 
ligand/HBD/ TAD, which has complexed with the PBD. Mechanistically, it will be easier to block 

4 0 than displace the interaction between the binding domain and its ligand. Potential inhibitors, i.e., 

test compounds, can be introduced into cells prior to complex formation (hormone addition) to 
block complex formation. 

D. Cell-Free Assay 

4 5 In another embodiment of this method, cell-free transcription systems prepared from cells 

as described above or prepared from the purified components described above may be used in lieu of 
the erKfineered cells. In a cell free implementation, peptide ligands may be synthetic or 
nonsynthetic peptides, peptoids or small molecules identified from libraries, cell broths, natural 
extracts, etc. The method of the assay is the same as above, except that it is performed in a cell free 



20 



wo 97/31113 



PCT/US97/02635 



1 0 



1 5 



20 



25 



30 



35 



40 



45 



I^\p''S*^'*'!f ^^u"'''''*'*** P*'^ inhibitors 

A Counterscreens 

compound to significantly alter the level of exoression of fh« « . 

an inhibitor expression of the reporter gene confirms its activity as 

A number of p,o,el„s ha.e ted TJTuZ.ISl^.^'Z^l':! '"""""^ '* ™ 
HowL,. ho^one SnZ tead"T*e ^^f^rj^ "L"^ 1"""='" 
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nzymes and transcription factors. Thus, with the addition of estrogen, detectable protein 
production w utd be curtailed, due to the dephosphorylation of the peptide ligand. Such cells could 
be used to screen for phosphatase inhibitors that block the reduction of reporter production that 
would occur when the cells are exposed to estrogen. 
5 In a further possible counterscreen. compounds may be tested in an assay as described 

herein, but replacing the PBD and peptide Ugand domains with SH3 and SH3 ligand domains as 
described in W095/2441 9. Compounds should not be active unless they inhibit a non PBP-related 
component of the assay system or also inhibit SH3-mediated protein-protein interactions. 

The inhibitors identified in the assay system of this invention can be further evaluated by 
1 0 conventional methods for possible therapeutic applications, toxicologicat and pharmacological 
activity. For example, test compounds so identified as inhibitors may further be evaluated for 
activity in inhibiting cellular or other biological events mediated by a pathway involving the 
PBD-based interaction of interest using a suitable cell-based assay or an animal model. Cell-based 
assays and animal models suitable for evaluating inhibitory activity of a test compound with 

1 5 respect to a wide variety of cellular and other biological events are known in the art. New assays 

and models are regularly developed and reported in the scientific literature. 

By way of nonlimiting example, compounds which bind to an SH2 domain involved in the 
transduction of a signal leading to asthma or allergic episodes may be evaluated in a mast cell or 
basophil degranulation assay. The inhibitory activity of a test compound identified as an SH2 

2 0 inhibitor by the method of this invention with respect to cellular release of specific mediators such 

as histamine, leukotrienes, hormonal mediators and/or cytokines, as well as its biological activity 
with respect to the levels of phosphatldylinositol hydrolysis or tyrosine phosphorylation can be 
characterized with conventional in vitro assays as an indication of biological activity. [See, e.g., 
Edward L. Barsumian et al. Eur. J. Immunol , n aiT-r^p.-^ (1981); 1^. J. Forrest. Biochem. 
25 Pharmacol,. 4£:1221-1228 (1991) (measuring N-acetyl-betaglucosaminadase from activated 
neutrophils): and V. M. Stephan et al.. J^fiicLCtlfillL. 2£Z:5434-5441 (1992)]. 

For example, histamine release can be measured by a radioimmunoassay using a kit available 
from AMAC Inc. (Westbrook. ME). One can thus evaluate the biological activity of inhibitors 
identified by the method of this invention and compare them to one another and to known active 

3 0 cornpounds or clinically relevant compounds which can be used as positive controls. 

Generally speaking, in such assays IC50 scores of 150-300 uM are considered of interest, 
scores of 50-150 uM are considered good, and scores below about SO uM are of high interest. 
Prior to in vivo models, inhibitors identified by this invention may also be tested in an ex vivo 
assay for their ability to block antigen-stimulated contraction of sensitized guinea pig tracheal 

3 5 strip tissue. Activity in this assay has been shown to be useful in predicting the efficacy of 

potential anti-asthma drugs. 

Numerous animal models of asthma have been developed and can be used (for reviews, see 
Larson, 'Experimental Models of Reversible Airway Obstruction", in THE LUNG, Scientific 
Foundations. Crystal, West et al. (eds.). Raven Press. New York, pp. 953-965 (1991); Wamer 
40 et al.. Am. Rev. Respir. Pis.. 141 .253-257 (1990)]. Species used in animal models of asthma 
include mice, rats, guinea pigs, rabbits, dogs, sheep and primates. Other in vivo models available 
are described In Cross et al.. Lab Invest. . 83:16P-I7n (1990); and Koh. et al., Science . 
25£.:1210-1213 (1992). 

By way of further example, compounds identified as PBP inhibitors by the method of this 

4 5 invention which bind to an PBD involved in the transduction of a signal involved in the initiation, 

maintenance or spread of cancerous growth may be evaluated in relevant conventional in vitro and 
in vivo assays. See e.g., Ishli et al., J. Antibioj, . 2iUl:1 877-1 878 (1989); and US Patent 
5,206,249 (issued 27 April 1993). 
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VI. Uses of Inhibitors Identified by This Invention 

n..JH^HT'^-"^T^*'T'^ "-^ '"^y '''o'^Sical reagents in assays as 

descnbed herein for «unct.onai classification of a PBD of a particular protein, particularty a newly 

K K . !f ^ specificity. iWloreover, inhibrtors identified by this invention can be used to 
PB? .nhLTno^rh L°' T''"' 'T' interactions med^ted by a 

b?oiog;rS'B?r^iJ^^^^^ ^^^^^^-^^ ^^^^^ ^'^^^ -^^^n^ th^ 

tr««tn,!m of'^'^f Hv °' ^^"^^ «''«'"P'«- *" diagnosis, prevention or 

treatment of conditions or diseases resulting from a cellular processes mediated by a PBP 

o..™rr T^""^^^- ^ "^"^"^ '^'^ '° P^«^°"' occurrence or progression of 

or hlZ^n" °' ? T?^ T ''^ administering ,o the patient in need thereofan SH2 binding 

or blocking agent which selectively binds Src SH2 u"iui"y 

usefuUh^LTrT «hich phosphopeptide binding or blocking agents may be 

PLSrarTaTn^rry^^^^^ '^""^ ^'^^^ ''^^ domain-containing proteins 

Src. PLCgamma and Grb7 have been implicated. Other relevant conditions include prostate cancer 
in which case targeting Grb2. PLCg. and PI3K. all of which contain SH2 domainfrSS be usefuT^ 

2 0 leukemia ?AML). '° myelogenous leukemia (CML) or acute myelogenous 

Still other relevant applications of an PBP inhibitor would be to prevent interferon- arowth 
factor-, or cytokine-mediated diseases (e.g. inflammatory diseases) by targeting the pISTof SW 
aSSon oT?ir ""tr f""""^ °' ZAP-yJ^which is believed'to be Lorved n 
Sone L b^m ^hTh "'f ^.o °' autoimmune diseases. An inhibitor that 

n !^?„» ♦ f 5 °' as an immunosuppressant to 

prevent rejection of skin and organ transplants. 

of Dha^rL^^rii? *^^ capacity to inhibit protein-protein interactions required for cellular events 

maS h«T.!H^ PBD/P«P"'1« "gand interaction inhibitors identified by the method 

Tneed tZtH! fT-'Kl"'"^' =°'"P°^"'°"^ '"^t^'^ds for treatment or prevention in a subject 

enTZ lTl A '° '^^"^ °^ P^^^^"» diseases or their pathological 

effects mediated by such interactions. 

nr^vpn^yf^r'"^' '^'"^^ Completely block one of the two ZAP SH2 domains should effectively 
prevent ZAP from associating with the activated TCR and thus block T cell activation A ZAP 
usl!?Z °' ^P^^'^'^aOy inhibit T cells and avoid the toxicity of the currently 

expfesTed o^^^^^^^^ drugs. FK506 and cyclosporin, which target the more ubiquitously ' 

tteruast .5rt»n ; ? f"""^ '^alcineurin is required for cellular activities in several 

S^rvous ,v.»^^^^^^ I f iy".'°^P°"" ^"'^ '^KSOe cause side effects in the kidney and central 
nervous system which limit the.r application largely to patients with organ transplant rejectTon. 

4 0 VII. Pharmaceutical Compositions and Methods 

A Compositions 

.nnJTnn^^V'^^"^^'^'^ formulated into pharmaceutical compositions 

4 5 ^TrZL ^'TT^'' prophylactically) effective amount of the inhibitor in adm xture 
^ceptab^ oroanic? ' ^'^^'P'^"'^ « • P^^armaceutically 

^^Tn^LTZZJ H^^"" '^^^"^^^"bstances suitable for parenteral administration) using 
SZ hoI ^ ^""^ ^"^^ ^ '"^'"'^^s but is not limited to saline buffered 

Si be sS^S'Thrfoi,?r"^''?r'^"^^ combinations thereof. The carrier and compo 
can be stenle. The formulation should suit the mode of administration. 



25 



30 



35 



23 



wo 97/31113 



PCTAJS97/02635 



The composition, if desir d, can also contain minor amounts of wetting or emulsifying agents, 
or pH buffering agents. Ttie composition can be a liquid solution, suspension, emulsion tablet pill 
capsule, sustained release fomriulation. or powder. Tfie composition can be fomiulated as a 
suppository, with traditional binders and carriers such as triglycerides. Oral formulation can 
5 include standard carriers such as pharmaceutical grades of mannitol. lactose, starch, magnesium 
stearate. sodium saccharine, cellulose, magnesium carbonate, etc. 

In a specific embodiment, the composition is formulated in accordance with routine 
procedures as a pharmaceutical composition adapted for intravenous administration to human 
beings. Typically, compositions for intravenous administration are solutions in sterile isotonic 
1 0 aqueous buffer. Where necessary, the composition may also include a solubilizing agent and a local 
anesthetic to ease pain at the side of the injection. Generally, the ingredients are supplied either 
separately or mixed together in unit dosage form, for example, as a lyophilized powder or water 
free concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
quantity of active agent. Where the composition is to be administered by infusion, it can be 

1 5 dispensed with an infusion bottle containing sterile pharmaceutical grade water or saline. Where 

the composition is administered by injection, an ampoule of sterile water for injection or saline 
can be provided so that the ingredients may be mixed prior to administration. 

Topical compositions include a pharmacologically acceptable topical carrier, such as a gel, an 
ointment, a lotion, or a cream, which includes, without limitation, such carriers as water, 

2 0 glycerol, alcohol, propylene glycol, fatty alcohols, triglycerides, fatty acid esters, or mineral oils. 

Other topical carriers include liquid petroleum, isopropyl palmitate, polyethylene glycol, ethanol 
(95%). polyoxyethylene monolaurate (5%) in water, or sodium lauryl sulfate (5%) in water. 
Other materials such as anti-oxidants, humectants, viscosity stabilizers, and similar agents may 
be added as necessary. 

2 5 Iwtaterials and methods for producing the various formulations are well known in the art (see 

e.g. US Patent Nos. 5.182,293 and 4,837,311]. 

B. Methods 

The invention provides methods of treating, preventing and/or alleviating the symptoms 

3 0 and/or severity of a disease or disorder referred to above by administration to a subject of a PBP 

inhibitor in an amount effective therefor. The subject will be an animal, including but not limited 
to animals such as cows, pigs, chickens, etc., and is preferably a mammal, and most preferably 
human. By "mammals" is meant rodents such as mice, rats and guinea pigs as well as dogs, cats, 
horses, cattle, sheep, nonhuman primates and humans. Such effective amounts can be readily 

3 5 determined by evaluating the inhibitors identified by this invention in conventional assays 

well-known in the art. including assays described herein. 

Administration of such composition may be by any conventional route using appropriate 
formulations as are well known in this art. Various delivery systems are known and can be used to 
administer the inhibitor, e.g., encapsulation in liposomes, microparticles, microcapsules. One 

4 0 mode of delivery of interest is via pulmonary administration. Other methods of introduction include 

but are not limited to intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, 
intranasal, epidural, nasal and oral routes. The inhibitor may be administered by infusion or bolus 
injection, by absorption through epithelial or mucocutaneous linings (e.g., oral mucosa, rectal and 
intestinal mucosa, efc.) and may be administered together with other biologically active agents. 
4 5 Administration can be systemic or local. For treatment or prophylaxis of nasal, bronchial or 

pulmonary conditions, preferred routes of administration are oral, nasal or via a bronchial aerosol 
or nebulizer. In specific embodiments, it may thus be desirable to administer the inhibitor locally 
to the area in need of treatment; this may be achieved by, for example, and not by way of limitation, 
local infusion during surgery, topical application, by injection, by means of a catheter, by means 
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Admmtettallon to an indivklyal of an iu^Tt^'^'!"'^"" """"""es. or liters. 

dose-response curves de^^f om t^^^^ ''^ extrapolated from 

effective dose of the inhibitor IslTe ^Ze otT^u^O^^^^^^^^^ ''^r'' ' '"^'"^^ 

range of about 1 to about 2000 mg J^r pL' ent °' *" " "^"^ 

the att^n^rph^sS^^^^ componer,t(s). should be determined by 

ir^cluding the PhosphopeptideLdlnrinrac^^^^^^^^^^^^ h 

and the age, body weight sex and oLe rhiLTn ^ »he route of administration, 

stage of th^e disea^se; ^ -nica, 



administration ^ ^ °' manufacture, use or sale for human 



Examples 

EXAMPLE 1: PLASMIDS 

40 A 



45 



kinase-Zeta ITAM in yeast This DlasmTri hlc IT ^ expression of GAL4 (1-147)-vSrc 

Amp (bacteria) or "L.rseS^^^ LoT Si ^Sr" '^'''T'' 
expression of the GAL4 DBD fusion proteL Downstream o^Ga ? J ^ sequences that direct the 
restriction endonuclease cloning sites "^^"stream of GAL4 coding sequences are multiple 

GAL4leq:e?ce?rpG/^^^^^^^^^ kinase domain was introduced downstream of 

Oligonucleotides, r^sle^l^'^^^:^^^^ The 5" and 3' 

.0 NO:1, and (Bam HI end, 9cGGATCCctcagcgac;:;c£~%|Q^T^^^^^ ^^^^ 
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residues 248-526 of vSrc. PGR generated vSrc kinase DNA was sequenced after cloning to check 
tfie fidelity of the amplification proc ss. 

Sequences encoding human Zeta chain cytoplasmic residues (Genbank Accession# J04132, 
residues 52-164), cloned into pET23b (Invitrogen) were excised using Nde 1 and Hind III. The 
5 restriction fragment encoding Zeta residues was blunt ended using T4 DNA polymerase and ligaled to 
blunt ended Sail digested pGTB9-vSrc Kinase DNA. The recombinant DNA was digested with 
restriction endonucteases to identify a clone which contained Zeta coding sequences in the proper 
orientation. 

1 0 B. pYAZ22, a GAL4 activation domain-tandem ZAP SH2 yeast expression vector. 

Plasmid pGAD424 (Clontech Laboratories) was used for the expression of GAL4 activation 
domain-tandem ZAP SH2 fusion protein in yeast. pGex2KT-huZAP{1-259). containing residues 
1-259 of human ZAP-70 (Genbank Accession M L05148) was digested with BamHI and blunt ended 
using T4 DNA polymerase. After the addition of EcoRI linkers (New England Biolabs, catalog 

1 5 #1020). nucleotide sequence encoding the tandem SH2 domains of ZAP (residues 1-259) was 

isolated as an EcoRI fragment and cloned into the EcoRI site of pGAD424. 

The recombinant DNA was digested with restriction endonucleases to identify a clone which 
contained ZAP SH2 domain coding sequences in the proper orientation. 

2 0 C pYerSZ, an estrogen regulated GAL4 (1-147)-vSrc kinase-Zeta ITAM yeast expression 

vector. 

Human estrogen receptor ligand binding domain (l.d.b.) coding sequences (residues 
282-595) were amplified by PGR from pHuER (Genbank Accession* Ml 2674) using the 5" and 3' 
oligonucleotide primers, respectively, (EcoRI end) cgGAATTCtctgctggagacatgagagct (SEQ ID NO:3] 

2 5 and (EcoRI end) cgGAATTCgactgtggcagggaaaccct (SEQ ID N0:4]. After digesting the amplified DNA 

using restriction endonuclease EcoRI, estrogen receptor l.d.b. sequences were cloned into EcoRI cut 
pYSZ (Example 1A). A recombinant clone was isolated that had estrogen receptor sequences in the 
right orientation. This new plasmid, pYerSZ, directs the expression of GAL4 DBD-estrogen 
receptor l.d.b.-vSrc kinase-Zeta ITAM fusion protein in yeast. 

30 

D. pMSZ, a GAL4 (1-147)-vSrc kinase-Zeta ITAM mammalian cell expression vector. 

pBXGI, a pEGE72-based vector (Sadowski I. and Ptashne, M., Nucl. Acids Res.. 17:7539 
(1989)], was used for the expression of GAL4 DBD fusion proteins in mammalian cells. pEGE72 
[Ellis, L.. et al. CM. 4^:721-732 (1986)] has the SV40 virus origin of replication, SV40 early 

3 5 promoter and SV40 polyadenylation regulatory sequences. GAL4 (1-147) and multiple cloning 

sites C-terminal to GAL4 coding sequences were obtained from pSKGAL147 [Kakidani, H. and 
Ptashne. M., CfiU, 52:161-167 (1988)] as a Hind Ill/Xbal fragment and inserted into Hind 
Ill/Xbal cut pECE72 to generate pBXG1. 

The sequences encoding vSrc kinase Zeta ITAM were excised from pYSZ (Example 1 A) as an 

4 0 EcoRI/Bglll fragment and cloned into EcoRI/BamHI digested pBXGI to generate plasmid pMSZ. 

capable of directing the expression of GAL4 (1-147)-vSrc kinase-Zeta ITAM in mammalian cells. 

£ pMAZ22, a Herpes Virus VP16 activation domain-tandem ZAP SH2 domain mammalian 
cell expression vector. 

4 5 pMVNI (Ivan Sadowski. University of British Colombia, Vancouver, GA) was used lor the 

production of transcription activation domain-tandem ZAP SH2 domain fusion proteins in 
mammalian cells. pMVN 1 contains the SV40 early promoter. HSV TK translational leader 
sequence. SV40 nuclear localization sequences and VP16 activation domain residues followed by 
multiple cloning sites for the construction of novel fusion proteins. The sequences encoding the 
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into EcoRI-cul pMSZ (Example 1D1 A ™Z,kS? . ' »ere donsd 

OBD-^l^^l^^t^T, '*) ="9l"«™d lor the proauetion „l GAL4 

. 5 S2, tsTwron^pSi' lTo C^aT S°"B™of TJT^r 

1^:1582-1590 (199511 usinn The I tZ Tt f ''- Mtl, Cell, Bihl . 

acgcGTCGACttataaatcaatgggaggag [SEQ ID NO -61 Th™f?» 
^ ^ BamH, and Sa... was cJSZ BaL^i^.r^gelJ^G^^^^^^^ ^"^^ 

e.preLn vZT ' <^-^'^>--s^rogen receptor Lt.d. -Src .ir^ase-Bete ,TAM yeast 

2 5 was isolated that had estroaen receofor^po ?J P^SB (Example 1G). A recombinant clone 

rirp-rfh-;:—"^"^^^^^^^^^^^^^ 

/• pYAS32, a GAL4 activation domain-'irr tw'^/cuo . 

3 0 PGAD424 rrinnoto^h I ""mam t,rc t>H3/SH2 yeast expression vector 

144-249) [see^anLka7Tnd F^^^ domain (residues 

5' and 3- oligonucleotides. respeSy ^ ^"""^^ ^'"P""^'' "='"9 «he 

fSEQ ID NO:91 and (BamHI end) 
ocactoGGATCCgccggggcacaoggtggtgaggc(SEQIDNO:10J toamni enaj 

Atter digestion with EcoRI and BamHI the Prn franr«o„. 
digested pGAD424. fragment was cloned into EcoRI/BamHI 
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K. pMSB, a GALA (1-147)-vSrc kinase-Beta ITAM chain mammalian cell expression 
vector. 

pYSB (Example 1G) was cut with restriction endonucleases EcoRI and Bglll and the DNA 
5 fragment encoding vSrc kinase-Beta ITAM chain residues was isolated and ligated to EcoRI/BamHI 
digested pBXGI (Example 1G). 

L pMerSB, a GALA (1-147)-estrogen receptor l.b.d. -vSrc kinase-Beta ITAM chain 
mammalian cell expression vector. 
1 0 pMSB (Example IK) was cut with restriction endonuclease EcoRI. An EcoRI DNA fragment 

containing the human estrogen receptor l.b.d. coding sequences, prepared as described in 1C above, 
was ligated to EcoRI digested pMSB. A recombinant clone was isolated that had estrogen receptor 
sequences in the right orientation. This new plasmid, pMerSB, directs the expression of GAL4 
DBD-estrogen receptor l.b.d. vSrc kinase-Beta ITAM fusion protein in mammalian cells. 

1 5 

M. pMAS32. a Herpes Virus VP16 activation domain-human Src SH3/SH2 domain fusion 
protein mammalian cell expression vector. 

pMVN 1 (Ivan Sadowski, Univ. B.C., Vancouver, CA) was used for the production of 
TAD-human Src SH3/SH2 domain fusion proteins in mammalian cells. Sequences encoding the Src 
20 SH3/SH2 domain (residues 84-249; see above) were amplified by PGR using the 5' and 3' 
oligonucleotides, respectively: 

(EcoRI end) cctcacGAATTCggtggagtgaccacctttgtggcc (SEQ ID NO:1 1] and (BamHI end) 
ccactcGGATCCgccggggcacacggtggtgaggc (SEQ ID NO:12]. 

After restriction endonuclease digestion using EcoRI and BamHI, the PGR product was ligated 

2 5 to EcoRI/BamHI digested pMVN 1 . 

N. pMAS2, a Herpes Virus VP16 activation domain-human Src SH2 domain fusion protein 
mammalian cell expression vector. 

Sequences encoding the Src SH2 domain (residues 144-249) [Tanaka et al, supra] were 

3 0 amplified by PGR using the 5' and 3' oligonucleotides, respectively: 

(EcoR I end) cctcacG/VATTCggcgactccatccaggctgaggag (SEQ ID NO: 13] and (BamHI end) 
ccactcGGATGCgccggggcacacggtggtgaggc (SEQ ID NO:14]. 

Alter digestion with EcoRI and BamHI, the PGR fragment was cloned into EcoRI/BamHI 
digested pMVNI (Example 1M). 

35 

Q pMerVP, a GALA DBD-estrogen receptor l.b.d.-VP 16 TAD fusion protein mammalian 
cell expression vector. 

An EcoRI DNA fragment containing the human estrogen receptor l.b.d. coding sequences 
prepared as described in 1C above was ligated to EcoRI digested pBXGI (Example 1G). Glones 

4 0 containing the estrogen receptor sequences in the right orientation (designated pMer) were 

identified by restriction endonucleases analysis. VP16 TAD residues were amplified using template 
plasmid pMVNI (Example 1M) and the 5' and 3' oligonucleotides, respectively: (BamHI end) 
cgGGATCGagcctgggggacgagctc (SEQ ID NO:151 and (Spel end) ggAGTAGTcccaccgtactcgtcaat (SEQ ID 
NO:161. After digestion with BamHI and Spel, the PGR fragment was cloned into BamHI/Spel 
4 5 digested pMer. 

P. pMAF2, a Herpes Virus VP16 activation domain-human Fyn SH2 domain fusion protein 
mammalian cell expression vector. 
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amniifLZ Prp i r ^ ""^^-^S^) fG«nbank M14333] were 

amplified by PGR using the 5' and 3" oligonucleotides such that the Fyn SH2 domain coding 
sequences contained EcoRI and BamHI ends. After digestion with EcoRI and BamHI the PGR 
fragment was cloned into EcoRI/BamHI digested pMVNI (Example 1M) 



EXAMPLE 2: STABLE MAMMALIAN CELL LINE 

Pvn^i "^'7 °' "^^'^ established by introducing selected plasmid DNA. described in 

Example 1 into mammalian cells using lipofectin reagent (Gibco BRL) as suggested by the 
manufacturer. ^ ^ 

n u ^^'r'f'* ""^ mammalian HT 1080 fibrosarcoma ceil line (American Type 

Culture Collecfon Accession No. CRL7951]. These cells are prepared by lipofectiin using a SEAR 
reporter construct comprising 

nrnmolor ,H ^.^'-'^.^^P^^^'ve/dependent promoter (5 GAL4 binding sites linked to the core IL-2 
promoter, the G5-IL2" promoter), which is placed upstream of 

{ b ) secreted alkaline phosphatase (SEAP) coding sequences; and 
a plasmid containing the hygromycin gene. 

This cell line G5.IL2 HT1080 was employed in a variety of the examples below. 

EXAMPLE 3: TWO-HYBRID MECHANISM-BASED IN VIVO ASSAY - ZETA 
Ce1.'S'^''°^" ^'^^ SH2-DEPENDENT TWO-HYBRID FORMATION IN MAMMALIAN 

rr»JiV yP''^ ^''^"If'^ ^^^^y °' '^'^ invention, a two hybrid interaction recapitulates a 
ml rinT "^"""^ "^"lely the SH2 domain-dependent binding of 

the 2AP-70 tyrosine kinase to the TCR zeta ITAM chain of the T-cell receptor 

. L .^^.'"-^ »TW80 fibrosarcoma cell line of Example 2 is transfected with two effector 
plasmids by lipofection: 

r^Ai JnLr^ <^^^'"P'^ the expression of a novel fusion protein that consists of 

S^i t .T' "^'^ ^"'^ ^"f"^" ^'"^AM chain. This fusion protein should 

0 C^Vl rl ^ T Tu 'T "'"^'^ ^^'^'^y- °' ^^^^^ molecules, localized at the 

3 0 G5-IL2 promoter, should help promote eHicient zITAM phosphorylation. 

2 ) PMAZ22 (Example IE) drives the expression of the tandem SH2 domains of ZAP fused 
to the herpes virus VP16 TAD. 

«oi ^""^ comparisons. HT1080 cells were also transfected with carrier DNA 

only, or pMSZ only, or pMS (GAL4 DBD-vSrc kinase), or pMS and pMAZ22 
3 5 The transfected host cells are cultured in 6 well dishes. Forty-eight hours after 

ransfection the amount of SEAP released into the media was detemiined using a fluorescence assay 

LsSon °' (*-^- the effect on ' 

transcription and the amount of ZAP SH2-Z ITAM interaction) is ascertained by measuring the 
amount {,.e.. the activity) of the SEAP reporter gene released into the media 
r«lea J Ipap f". "Ik*"' ^'.^L^.";! '^""^^ ^ells transfected with carrier DNA do not 

J!! •^^r*^ ^' ^^l-'* °^°-^Src kinase-Zeta ITAM fusion protein. 

When expressed in cells, is not a competent transcriptional activator (FIG. 2, col 2) However 

;:fduced dl ''r^iff ^^^^ '^^^^ - trariscnption is' 

45 mer?F?G i T ^""^ ""^^'^ °^ SEAP into the culture 

J^infti T?"^" necessary for the induction of SEAP production in 

^d rtandl^'Src^So H " '"t«^a«ion of tyrosine phosphorylated Zeta ITAM 

^otls Snhff H K °' ^^f' P^°^"^'°" 'f^^t both of the fusion 

proteins descnbed above be expressed. The recognition and binding of tyrosine phosphorylated 
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zITAM by the ZAP SH2 domains effectively bring the VP16 TAD to the promoter, thereby inducing 
transcription of the SEAR reporter gene. 

Two hybrid formation is Zeta ITAM chain-dependent (compare FIG. 2. cols. 3 and 5) 
Furthermore. GAL4 DBD fusion proteins which lack the vSrc kinase domain (GAL4 DBD-Zeta 
5 ITAM). or which lack the Zeta chain, i.e.. pMS (GAL4 DBD-vSrc kinase), fall to induce SEAR 
production when produced in cells alone or with pH/IAZ22, i.e.. the VR16 TAD-tandem ZAR SH2 
fusion protein (see FIG. 2. cols. 4 and 5). See. also, cols. 1. and 2 for controls. In addition to the 
results shown in FIG. 2, cells that contain plasmids that encode GAL4 DBD and VR16 TAD-tandem 
ZAP SH2, or GAL4 DBD-vSrc kinase-Zeta ITAM and VP16 TAD, fail to produce SEAR 
1 0 Hence, in a heterologous cell line the tandem ZAR-SH2:hlTAM interaction that normally 

occurs in T-cells has been reproduced and a simple, sensitive means to monitor the extent to which 
these two molecules interact has been presented by this invention. 

Using SEAR production as an indicator of the interaction of these two molecules, it is possible 
to Identify compounds that can specifically inhibit ZAP SH2 domain-dependent interactions When 

1 5 the host cells are cultured, they are exposed to test compounds. The extent to which the test 

compounds successfully interrupt the interaction between the ZAP SH2 and the zITAM indicates the 
presence of an SH2 inhibitor. 

With the assay set up as described, one potential drawback when screening for molecules that 
inhibit ZAP SH2-dependent interactions is that the constitutive expression of the interacting 

2 0 components of the two-hybrid screen would result in constitutive ZAP SH2-Zeta ITAM complex 

formation. Hence, this embodiment of an assay identifies only compounds that efficiently disrupt 
two-hybrid formation. 

EXAMPLE 4: TRANSCRIPTIONALLY INDUCED TWO-HYBRID FORMATION- AN 

2 5 ALTERNATIVE TWO-HYBRID ASSAY FOR MAMMALIAN CELLS SCREEN FOR 

COMPOUNDS THAT INHIBIT ZAP SH2 DOMAIN FUNCTION 

In another embodiment of the assay of this invention, cells are exposed to potential inhibitory 
test compounds and then the interaction of proteins whose association is to be inhibited is induced. 
This embodiment of the assay depends on use of plasmids tor the production of both the Zeta ITAM 

3 0 and tandem ZAP SH2 fusion proteins in the cells (as described in Example 2). which have inducible 

transcriptional regulatory sequences, rather than constitutive (i.e.. SV40 virus based) 
transcriptional regulatory sequences. 

An example of an inducible transcriptional regulatory sequence includes the mouse mammary 
tumor virus (MMTV) dexamethasone inducible promoter. Other known regulatory sequences are 

3 5 useful here. Thus, if, for example, the sequences encoding GAL4 DBD-vSrc kinase-Zeta ITAM 

fusion protein were linked to the MMTV inducible regulatory sequences, fusion protein production 
would be dexamethasone-dependent. The reporter cell line is then exposed to potential inhibitory 
compounds and two-hybrid formation is induced using dexamethasone. In this embodiment Zeta 
ITAM-tandem ZAP SH2 two-hybrid formation occurs if SH2 domain(s) binding is not affected by 

4 0 inhibitory compounds. 

The mammalian G5-IL2 HT 1080 fibrosarcoma cell line (Example 2) is transfected with two 
effector plasmids by lipofection: 

( 1 ) One plasmid constitutively drives the expression of TAD - ZAP tandem SH2. 

( 2 ) The other plasmid encodes a fusion protein GAL4DBD-vSRC kinase domain - z ITAM 
4 5 expressed using the dexamethasone (Dex) inducible MMTV promoter. 

These transfected mammalian cells are seeded in 96 well dishes and serial dilutions of test 
compounds are added to the wells. Dex is added to the media. Phosphorylated z ITAM fusion protein 
accumulates in cells upon the addition of Dex. After a 24 hour incubation, media is removed from 
the 96 well dish and placed into a second 96 well dish to perform a chromogenic or fluorescence 
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By binding of the fusion proteTto the ikp S2 Si ' ^ ^".^fr °' ^^say. 
promoter and SEAP is produced. CompT^^^^ '^V'^^ "^"9*" SEAP 

5 before dex-induced production of phoTpS^d h ,tZ'^^^^ r *° *° '^^'"^'"^ 
function reduce SEAP production c3ouT£ 1^ f^'^' ^o'^PO^'^ds that block ZAP SH2 

These transfected mammalian cells are seeded in qr ^» ^^T'' e 
transfection, the amount of SEAP released Jo th! ml ^" ''°«y-e*9ht hours atter 

The results are shownirp G 6 col 1 i^^L f T ^^'^"^'''^^ "=*"9 a fluorescence assay, 
receptor Beta ITAM fusiorp;otein) is not b setfcomo'lten^^^^ (GAL4 DBD-vSrc kInase-lgE 
However, when pH4AS2 {VP16-Src SHp „.L1 . ? transcnptional activator, 

induced (see col.'s). fn ItrLfvP 'e'an zTsU^TsnTT ^^^"^"'"^'^^ 
"'^tj^n '™ P-teir,tTnd?c: ^EAP^r due ^^^^^^^^^^ 

fibrosa?orSoTcX7s1^^^^^^^ DBD-vSrc Jinase-ZetiTT^M Inl HT1080 

residues are replied wtth Z lela ITAM^« '°" ^'^^'^ " ''^♦^ 'TAM 

not observed uJ^ng These conditfons 9 ^^.^^^ - 

domai^spe"c!;;f^etnder^^^^ "^"t'':' ^P^^- '^AM and SH2 

Beta fTAM (FIG. Tco 13) T^^ Src SH2 T^^^^^ ^''^ '^^"^ ("^'^S- 2 and 3). but not to the 

to Zeta .TAfi residues^^'i obse^tLtn ^^^^^^ If ™f 6) but not 

s:?a:^dSs;=r~^ 

40 — /.n^^^^aSi;^^^^^^^ '^^^-^^ --^^^^^ '-ar^n /n 

effecto?;,aTmTdr2rnS>Son"' '''' <^--P'« 2) is transfected with two 

.TAM ;uUnTr:;ein In m^lSmali^n'S °' ^^'"^ "^^^'^^ ' ''■'^-S- ^'-e-Zeta 

45 2) PMAZ22.VP16TAD.tandem ZAP SH2 (Example IE) 

For controls and comparisons, the G5IL -2 HTinBn^.»ii> 
DNA. or With pMSZ only, or with pMSZ and S^A^lT^rh ^ J,^^,^^^ carrier 
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Each of these transfected mammalian cells ar seeded in 6 well dishes and serial dilutions of 
test compounds are added to the wells. Tw nty-four hours after transfection, one duplicat dish of 
cells was cultured in the presence of 10 nM estrogen. 

Forty-eight hours after transfection, an aliquot of media from all of the dishes was removed 
5 from the 6 well dish and placed into a 96 well dish to pertomi a chromogenic or fluorescence assay 
for SEAP activity. 

FIG. 3 provides the results of this assay: The cell line transfected with carrier DNA is 
reported in cols. 1 and 2; or with pMSZ (Example ID) in cols. 3 and 4; or with both pMSZ and 
p(WlAZ22 in cols. 5 and 6. Cols. 7 and 8 are the results of the cell line transfected only with plasmid 
1 0 pf^erSZ. Cols. 9 and 10 are results of the cell line transfected with both pMerSZ and plulAZ22. as 
described above. 

The addition of the estrogen receptor l.b.d. residues to the GAL4 DBD-vSrc kinase-Zeta ITAIWI 
fusion protein makes two-hybrid formation estrogen-dependent (compare FIG. 3. cols. 9 - 10). 
That is, in the absence of exogenously added estrogen, the fusion protein {GAL4-DBD-ER l.b.d.- 

1 5 vSrc kinase-g-ITAM) is not functional. In fact, FIG. 3, col. 9 shows that only low levels of SEAP 

are detected in the media. This is presumably due to the low levels of estrogen present in serum. 
Low levels of SEAP are not observed if the cells are cultured in a defined synthetic media lacking 
estrogen. When plasmids pMerSZ and pMA222 are expressed together in cells, two-hybrid 
formation (reporter gene transcription) should not occur (col. 9), unless estrogen is present in 

2 0 the media (col. 10). When these transfected cells were exposed to 10 nM estrogen, copious 

amounts of SEAP was released into the media. 

Estrogen did not markedly affect two-hybrid formation if the estrogen receptor l.b.d. was not 
present in either plasmid expressing fusion protein (cols. 5 and 6). Also, the estrogen receptor 
l.b.d. residues do not make the GAL4 DBD-vSrc kinase-Zeta ITAM fusion protein itself a 

2 5 transcriptional activator (FIG. 3, Cols. 7 and 8). 

SEAP induction was time-dependent, the majority of the SEAP was produced 12-18 hours 
after exposure of the cells to estrogen (see FIG. 5 for analogous results with stably transfected 
cells). 

3 0 a Estrogen regulation of Beta ITAM-src SH2-dependent two-hybrid formation in 

mammalian cells (transient transfection). 

In a manner similar to that described above for the Zeta ITAM constructs, G5-IL2 HT 1080 
cells were transfected with the following plasmid DNA (in duplicate): pMerSB or pMerSB and 
pMAS2. Twenty-four hours after transfection. one dish of cells was cultured in the presence of 10 

3 5 nM estrogen. Another twenty-four hours later, an aliquot of media from all of the dishes was 

analyzed for SEAP activity. 

As shown in FIG. 7, the presence (+) or absence (-) of estrogen is indicated, and the columns 
represent cells transfected with pMerSB (Example 1L) (Cols. 1 and 2] or with pMerSB and pMAS2 
(Example IN) [Cols. 3 and 4). The addition of the estrogen receptor l.b.d. residues to the GAL4 

4 0 DBD-vSrc kinase-Beta ITAM fusion protein makes two-hybrid formation estrogen-dependent 

(compare cols. 3 and 4). In contrast, the estrogen receptor l.b.d. residues do not make the GAL4 
DBD-vSrc kinase-Beta ITAM fusion protein itself a transcriptional activator (cols. 1 and 2). 

C Stable cell line that constitutively expresses GAL4 DBD-estrogen receptor l.b.d.-Src 
4 5 kinase-g-ITAM and ZAP-SH2 activation domains in mammalian cells. 

G5IL-2 HT1080 cells that contain an integrated GAL4-dependent SEAP reporter gene 
(Example 2) were transfected with pBabeNeo (neomycin selection). pMerVP (Example 10). and 
the two effector plasmids that drive the constitutive production of GAL4 DBD-estrogen receptor 
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l.b.d.-vSrc kinase-Zeta ITAM and VP 16 TAD-tandem ZAP qw9 i o nM«.,c7 . 
PMA222 (Example ,E,. ,«pe*e„. „„e J^ZT^,!'^ ^.stm'' "Z" ^ 

presence of 10 oM estrogen and. al vanous «wS^mT,.ST^ , ' ' 
assayed ,o, the prasence'of SEAP a«M7 As sho^t FIG 1 rbl'S ="l.ore media wa. 

D. Conrro/ Ce// L/ne 

^^^JZT^^L^T'^" """S •"'™'" co-PO"nds tl,at a« 

gIu DBD^Xen^L?^^^^^^^^^ » -«"'vel, egresses 

I ...d..:?%T.t j-jrafa^es::: ~ 

t':°- c°ell^~= "~ asVel°;r:r S IrSeTnen^'" 

0 1 • Competitive Binding Assays- 
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(HBS) composed of 10 mM HEPES (pH 7.4) / 150 mM NaCI / 3.4 mM EDTA / 0.05% Surfactant 
P20 at 25°C and a flow rate of 5 uL min-1 . 
a Specifics of Tandem Syk Assay 

A pp72syk peptide ligand corresponding to the g-chain ITAM of human FceRI 
5 (DGVY(P04)TGLSTRNQETY(P04)ETLK (SEQ ID N0:17]] was synthesized as part of a larger peptide 
(Ac-CGGDGVY(P04)TGLSTRNQETY-(P04)ETLK-NH2 (SEQ ID NO:18)l and used to generate a 
Syk-sensitive biosensor surface. Specifically, a Biosensor Chip CMS was activated with 200 mM 
ethyl-3-(3-dimethylaminopropyl)-carbodlimide hydrochloride (EDC)/ 50 mM 
n-hydroxysuccinimide (NHS) to generate a surface reactive to primary amines; treated with 

1 0 ethylenediamine to generate a surface rich in primary amines; activated with 

m-maleimidobenzoyl-n-hydroxysuccinimide ester (sulfo-MBS; 50 mM in 25 mM NaHC03) to 
generate a surface reactive to free thiols; and the ITAM peptide immobilized through the 
n-terminal cysteine. Unreacted sites were blocked with b-mercaptoethanol and the chip cleaned of 
noncovalently bound peptide using 6 M guanidine hydrochloride. Assays were run in HBS using 20 
15 nM pp72syk(1-265) W- lest inhibitor. 

b. Specifics of C-Syk Assay 

A pp72syk peptide ligand corresponding to a hemiphosphorylated g-chain ITAM of human 
FceRI (DGVY(P04)TGLSTRNQETYETLK [SEQ ID NO:19]J was synthesized as part of a larger peptide 
[Ac-CGGDGVY(P04)TGLSTRNQETYETLK-NH2 [SEQ ID NO:20]1 and used to generate a 

2 0 C-Syk-sensitive biosensor surface as described above for tandem syk. Assays were run in HBS 

using 270 nM pp72syk{1 63-265) +/- test inhibitor. 

c. Specifics of Tandem ZAP Assay 

A pp70ZAP peptide ligand corresponding to the z-chain lTAM-1 of the human T-cell receptor 
[NQLY(P04)NELNIGRREEY(P04)DVLD [SEQ ID NO:21]] was synthesized as a part of a larger 

2 5 peptide lAc-KGGNQLY(P04)NEUMIGRREEY-(P04)DVLD-NH2 [SEQ ID NO:22] and used to generate a 

ZAP-sensitive biosensor surface. Specifically, a Biosensor Chip CM5 was activated with 200 mM 
EDC / 50 mM NHS to generate a surface reactive to primary amines and the ITAM peptide 
immobilized through the n-terminal lysine. Unreacted sites were blocked with ethanolamine (1 M 
in water) and the chip cleaned of noncovalently bound peptide using 6 M guanidine hydrochloride. 

3 0 Assays were run in HBS using 10 nM pp70ZAP(1-259) +/- test inhibitor. 

d. Specifics of Src Assay 

A p60src peptide ligand corresponding to the hamster Middle-T antigen (QY(P04)EEIPI [SEC 
ID NO:23]) was synthesized as a part of a larger peptide (Ac-KGGQY(P04)EE1PI-NH2 [SEQ ID 
NO:24]) and used to generate a src-sensitive biosensor surface as described above for tandem ZAP. 

3 5 Assays were run in HBS using 270 nM pp60src( 144-251) +1- test inhibitor. 

EXAMPLE 8: YEAST-BASED TWO-HYBRID SCREENING ASSAYS 

The constitutive as well as inducible Src SH2- and ZAP SH2 dependent two-hybrid systems 
described here using mammalian cells are also useful with yeast cells. A yeast strain which 

4 0 contains an integrated reporter gene consisting of the B-galactosidase gene linked to a promoter 

that is GAL4-dependent has been demonstrated. Two-hybrid formation induces B-gatactosidase 
production, which can be detected using an enzyme assay, or from the blue color produced when 
cells producing enzyme are exposed to X-GAL. As with mammalian cells, SH2-dependent 
two-hybrid formation in yeast can be rendered estrogen-dependent using the estrogen receptor 
4 5 ligand binding domain. 

A yeast strain such as the one that contains a mutation in the srbl gene is used for the 
two-hybrid assay. Cells containing the tsl allele of srtjl exhibit increased endocytosis and have 
generally increased permeability. Cells are engineered so that plasmids used for the two-hybrid 
screen can be selectively propagated. 
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a) HIS3-1,2,4 triazole Assay 

In this assay, cells auxotrophic for histidin are transformed with a HIS3 qene controlled bv 
TTTL ^^l'^^"*^'"^ ^'"'^ DNA binding sites. The tandem SH2 domains ofzAP a^e exprS^^^ 
cells linked to the LexA DBD. This fusion protein binds to DNA. but is not a competent 
5 transcnpfonal activator. The TCR z ITAM sequence is also expressed in the S as oart of a 
hybnd protein that includes a nuclear localization sequence. L Lck tyrosine k na^ camMic 

nZ:^s'wl^h''"T.T/ "^^^ "^'-^^ domam shou.d be ac«ve Ind n the 

ortTraT ' P^osphorylate the tyrosine residues of z ITAM sequence erther in c/s 

' ° proteiJ witrthe SHfdomSr'f ' ' •.^^'^-t^anscriptional activation domain fusion 

^In nT«r -f^r ! domains-LexA protein activates HISS transcription, generating cells that 
'".r^'^ k""?^ Compounds that interfere with the interaction of Vf^e z^P SH2 

eTcZJl!,';^^ ' '■"''^ ^^''"^ °' transcription. Compounds that 

efficiently block this interaction prevent yeast growth in media lacking histidine However 

bXTe?mustr? '"^11^" detecte^Ls hisS production 

by the cell must be dramatically reduced to affect viability Hiouuunon 

ZAP JJp zTtam 1°' "e adapted to be able to detect molecules that weakly block 

There are two counterscreens useful to evaluate whether compounds that score in the above 
assay exert their effect on growth by specifically affecting ZAP SH2 z ITAM assodSion fn one 

P™^^"'^^ °' ' i-tidine. 2^ P SH2 inhSo s do not affect 
the growth of cells under this condition. In a second counterscreen. an identiS S 
oeSli^if^V'T 'I"* the transcription of H1S3 is driven by a monomer of the exA Db£ 

cells) tn^meS'^,;!l!VrV'°.'i '''"' ^^^^ growth are seeded in 96 well dishes (0.1 mis of 
cells) in media that lacks histidine and contains 3-amino-1.2.4-tria2ole Serial dilutions of 

?^TaZ? T 1° '^"""^"^ days at 30oC on rsha^e "atfo m 

The attect of compounds on growth is assessed using a 96 well plate reader 

This assay depends on growth inhibition (the interference of two-hybrid dependent H1S3 

STuS^^^ ^''^""''^ to coSir^S the 

compounds that scored positive are not toxic to yeast 

b ) URA3/5-FOA Assay: 

inhih-Za^^^"^'^ approach for identifying molecules that inhibit ZAP SH2 binding in yeast scores 

Lly trt::dT^plH^^^ '° ♦^^-^^"^'^ ^^^-^-^ ceVde'ar .n ^s 

assay, ine tandem ZAP SH2 domain is expressed in yeast fused to the LexA DBD and thP r itam i r-u 

S^s'at:: thf *° r constructions described above Vol^^^^^^ 
this assay, the reporter gene, which is regulated by the LexA DBD is the URA3 gene 

K Fn?f TK * ^"^^ 9'"" P''''^"*^' exposed to 5-fluoro-orotic acid 
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reduce the URA3 gene product levels (and hence the fluorodeoxyuridine levels) below the level 
wh re toxic amounts of fluorodeoxuridlne levels are formed. 

In contrast to the assay of 8A above, toxic compounds are not scored when using the 
"URA3/5-FOA" screen, which is a cell viability assay. 

5 

EXAMPLE 9: ASSOCIATION OF SRC PROTEIN WITH TYROSINE PHOSPHORYLATED 
PROTEINS IN VIVO 

Several proteins have been reported to coprecipitate with v-Src from extracts of 
v-Src-transformed cells [Kanner, S.B. et al.. EMBO J. . 10:1689-1698 (1991); Koch. C.A.. MaL 
1 0 Cell. BlQl..ia:1 366-1 374 (1992)]. The ability of a high affinity Src SH2-binding peptide 

(EPQpYEEIPI. denoted here as VEEI [SEQ ID NO:25]) to block SH2 domain-dependent interactions is 
demonstrated in this assay. 

In this assay, Balb/c 3T3 and SRD-3T3 cells (V-Src transformed 3T3 cells) were lysed in 
RlPA buffer and the Src protein was immunoprecipitated with a monoclonal antibody to the 

1 5 amino-terminus of Src in the presence or absence of the SH2 and SH3 binding peptides. The 

samples were transferred to nitrocellulose, and the blot was probed with a monoclonal antibody to 
phosphotyrosine. Two major tyrosine phosphoryiated proteins co-migrated with v-Src from SRD 
3T3 cells, one of which co-migrated with the 62 kDa tyrosine-phosphorylated protein that bound 
to the Src SH3 domain, and the other protein of Mr 130,000, which is most likely the 130,000 

2 0 kDa protein previously detected in v-Src immunoprecipitates [Koch, C.A.. Mol. Cell. Biol. . 

12.:1366-1374 (1992)). 

The binding of Src to the 62 kDa protein was reduced significantly by incubation of the 
extracts with the "Src-pro" peptide or the phosphoryiated VEEI (SEQ ID NO:251 peptide, but not the 
poly-proline nor the unphosphorylated VEEI [SEQ ID NO:251 peptides, suggesting that both the SH2 

2 5 and SH3 domains of Src are required for stable association of this protein with Src. This protein 

has not been determined to be the p62 protein identified in the total cell SH3 binding proteins since 
the p62 antibody is not sensitive enough to recognize the small amounts of the 62 kDa protein that 
associate with Src under these conditions. The "Src-pro" peptide and the pVEEl [SEQ ID NO:25] 
peptide could only block 50% of pi 30 binding to Src. whereas 90% inhibition of pi 30 binding was 

3 0 observed when both peptides were used, suggesting that either the SH2 or the SH3 domain is 

sufficient for some degree of p130 association with Src. 

This invention is not to be limited in scope by the specific embodiments described herein. 
Indeed, various modifications of the invention in addition to those described herein will become 
£ipparent to those skilled in the art from the foregoing description. Such modifications are intended 
3 5 to fall within the scope of the appended daims. 

The disclosures of the patents, patent applications and publications cited herein are 
incorporated by reference in their entireties. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

5 (i) APPLICANT: ARIAD PHARMACEUTICALS INC 
RICKLES. Richard J. 

(ii) TITLE OF INVENTION: CELL-BASED ASSAY 

1 0 (iii) NUMBER OF SEQUENCES: 25 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ARIAD Pharmaceuticals, Inc. 

(B) STREET: 26 Landsdowne Street 

1 5 (C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02139-4234 

2 0 (V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 
(D) SOFTWARE: PatentIn Release #1 .0, Version #1 .30 

2 5 

(vi) CURRENT APPUCATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
(C) CLASSIFICATION: 

30 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/012 218 
(B) FILING DATE: 23-FEB-1996 

3 5 (viil) ATTORNEY/AGENT INFORMATION: 

(A) NAME: BERSTEIN, David L. 

(B) REGISTRATION NUMBER: 31.235 

(C) REFERENCE/DOCKET NUMBER: ARIAD 336-PCT 

4 0 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-494-0400 

(B) TELEFAX: 617-494-0208 

(2) INFORMATION FOR SEQ ID NO V 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1 : 

CGGAATTCTC CAAGCCCCAG ACCCA 25 
(2) INFORMATION FOR SEQ ID NO:2: 

1 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

1 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

20 

GCGGATCCCT CAGCGACCTC CAACA 25 
(2) INFORMATION FOR SEQ ID NO:3: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

3 5 CGGAATTCTC TGCTGGAGACATGAGAGCT 29 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 5 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
CGGAATTCGACTGTGGCAGGGAAACCCT 28 
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(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

1 0 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
GCGGATCCGG AGCTGGGGAA GAACTCA 27 

1 5 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

2 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
ACGCGTCGACTTATAAATCAATGGGAGGAG 30 

3 0 (2) INFORMATION FOR SEQ ID NO:7: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

3 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

CCTCACGAAT TCGGTGGAGT GACCACCTTT GTGGCC 
(2) INFORMATION FOR SEQ ID NO:8: 

45 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

CCACTCGGAT CCGCCGGGGC ACACGGTGGT GAGGC 
(2) INFORMATION FOR SEQ ID N0:9: 

1 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

1 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9' 

20 

CCTCACGAAT TCGGCGACTC CATCCAGGCT GAGGAG 
(2) INFORMATION FOR SEQ ID NO:10: 

2 5 (0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;10: 

3 5 CCACTCGGAT CCGCCGGGGC ACACGGTGGT GAGGC 

(2) INFORMATION FOR SEQ ID NO:1 1 : 

(0 SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 5 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1 1 : 
CCTCACGAAT TCGGTGGAGT GACCACCTTT GTGGCC 
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(2) INFORMATION FOR SEQ ID NO: 12; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
CCACTCGGAT CCGCCGGGGC ACACGGTGGT GAGGC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

CCTCACGAAT TCGGCGACTC CATCCAGGCT GAGGAG 
0 (2) INFORMATION FOR SEQ ID N0:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

CCACTCGGAT CCGCCGGGGC ACACGGTGGT GAGGC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

CGGGATCCAGCCTGGGGGACGAGCTC 26 
(2) INFORMATION FOR SEQ ID N0:16: 

1 0 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

1 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

20 

GGACTAGTCC CACCGTACTC GTCAAT 26 
(2) INFORMATION FOR SEQ ID NO: 17: 

2 5 (i) SEQUENCE CHAFIACTERISTICS: 

(A) LENGTH: 1 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

3 5 (B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "phosphorylated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

4 0 (B) LOCATION: 15 

(D) OTHER INFORMATION: /note= "phosphorylated" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

4 5 Asp Gly Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu Thr Tyr Glu 
15 10 15 

Thr Leu Lys 
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(2) INFORMATION FOR SEQ ID N0:18: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

1 0 (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 1 

1 5 (D) OTHER INFORMATION: /note= "N-acetylated° 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 7 

2 0 (D) OTHER INFORMATION: /note= "phosphorylated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 18 

2 5 (D) OTHER INFORMATION: /note= "phosphorylated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 22 

3 0 (D) OTHER INFORMATION: /note= "amidated" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Cys Gly Gly Asp Gly Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu 
35 1 5 10 15 

Thr Tyr Glu Thr Leu Lys 
20 

4 0 (2) INFORMATION FOR SEQ ID NO:1 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
4 5 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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1 0 



1 5 



40 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "phosphorylated" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 

Asp Gly Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu Thr Tyr Glu 
15 10 15 

Thr Leu Lys 



(2) INFORMATION FOR SEQ ID NO:20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
2 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

2 5 (A) NAME/KEY: Modified-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "acetylated" 

(ix) FEATURE: 

3 0 (A) NAME/KEY: Modified-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "phosphorylated" 

(Ix) FEATURE: 
3 5 (A) NAME/KEY: Modified-site 

(B) LOCATION: 22 

(D) OTHER INFORMATION: /note= "amidated" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Cys Gly Gly Asp Gly Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu 
15 10 15 



Thr Tyr Glu Thr Leu Lys 
4 5 20 

(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "phosphoryiated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 15 

(D) OTHER INFORMATION: /note= "phosphoryiated" 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Asn Gin Leu Tyr Asn Glu Leu Asn lie Gly Arg Arg Glu Glu Tyr Asp 
15 10 15 

Val Leu Asp 



(2) INFORMATION FOR SEQ ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "N-acetylated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "phosphoryiated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 18 

(D) OTHER INFORMATION: /note= "phosphoryiated" 
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(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 22 
5 (D) OTHER INFORMATION: /note= "amidated" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Lys Gly Gly Asn Gin Leu Tyr Asn Glu Leu Asn lie Gly Arg Arg Glu 
10 1 5 10 15 

Glu Tyr Asp Val Leu Asp 
20 

1 5 (2) INFORMATION FOR SEQ ID NO:23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

2 0 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

2 5 (ix) FEATURE; 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "phosphorylated" 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Gin Tyr Glu Glu He Pro He 
1 5 

3 5 (2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

4 0 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

4 5 (ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= °N-acetylated" 
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(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "phosphorylated" 

5 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 10 

(D) OTHER INFORMATION: /note= "amidated" 

1 0 

(xi) SEOUENCE DESCRIPTION: SEQ ID NO:24: 

Lys Gly Gly Gin Tyr Glu Glu lie Pro lie 
1 5 10 

1 5 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

2 0 (B) TYPE: amino acid 

(C) STRANDEDNESS; 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Tyr Glu Glu lie 
1 
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CLAIMS 

1 . A cell containing and capable of expressing recombinant DNA molecules encoding two fusion 
proteins, 

5 ( a ) a first fusion protein comprising one or more copies of each of a transcription 

activation domain and one member of a phosphopeptide binding pair, 

( b ) a second fusion protein comprising one or more copies of each of a DNA-binding domain 
and the other member of the phosphopeptide binding pair, 

wherein the phosphopeptide binding pair comprises (i) a peptide ligand containing one or 
1 0 more tyrosine residues and (ii) a phosphopeptide binding domain capable of binding to the peptide 
ligand upon phosphorylation of one or more of the peptide ligand's tyrosine residues, and 

wherein the cell further contains a reporter gene which is linked to a DNA sequence to which 
said second fusion protein is capable of binding and which reporter gene encodes a detectable gene 
product which is expressed when said transgene is activated by proximity to a transcriptional 

1 5 activation domain. 

2 . The cell according to claim 1 wherein said phosphopeptide binding pair comprises an SH2 
domain and a ligand therefor, a phosphatase domain and a ligand therefor, or a PTB phosphopeptide 
binding domain and a ligand therefor. 

20 

3. The cell according to claim 1 wherein the recombinant DNA molecules encode 

(a) a first fusion protein which comprises one or more copies of a transcription activation 
domain and a phosphopeptide binding domain, and 

( b ) a second fusion protein which comprises one or more copies of a DNA-binding domain 

2 5 and the peptide ligand capable, upon phosphorylation, of binding to the phosphopeptide binding 

domain. 

4 . The cell according to claim 3 wherein the second fusion protein further comprises a kinase 
domain. 

30 

5 . The cell according to claim 3 wherein the first fusion protein contains two phosphopeptide 
binding domains. 

6 . The cell according to claim 1 wherein one of the fusion proteins further contains a hormone 

3 5 binding domain permitting regulated sequestering of the fusion protein. 

7. The cell according to claim 1, which is mammalian. 

8. The cell according to claim 1, further comprising an endogenous kinase domain. 

4 0 9. A method for identifying the presence in a test composition of an inhibitor of the binding of a 

phosphopeptide binding pair which comprises the steps of: 

( a ) providing cells of claim 4 or 8, which when cultured are capable of producing 
measurable amounts of said detectable gene product, 

( b ) culturing said cells in the presence and absence of the test composition, and 

4 5 ( c ) determining whether production of the detectable gene product was diminished in the 

presence of the test composition. 

10. The method according to claim 9 wherein said cell is a mammalian cell. 
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11. The method according to claim 9, wherein the DNA-binding domain and transcriptional 
activation dornain are denv d from transcriptional activators having separable DNA-binding and 
transcriptional activation domains. u>naing ana 

^ IL„Tnn T^""^ ^""^"'l'"^* f ^ ^ • ^'"^'"9 the transcriptional 

activation domain are selected from the group consisting of transcriptional activators GAL4 GCN4 

1 0 IctiJUl T^°'^ ^'"'f '^'"^ ^' DNA-binding domain and the transcriptional 

1 0 activation domain are from different transcriptional activators. 

14. The method according to claim 9 wherein said phosphopeptide binding pair comprises an SH2 
binding domain and a ligand therefor 

1 5 

biLinrdomains.' '° ' ''''' '^""^^'"^ Phosphopeptide 

7n h ^ ■ ■^^®^'"5^'°'^/'=<=°'-ding to claim 9 wherein one of the fusion proteins further contains a 

2 0 hormone binding domain permitting regulated sequestering of the fusion protein. 

17^ The method according to claim 9 wherein said test composition is selected from a mixture of 
one or more test peptides, wherein said mixture is provided in the form of a library of synthetic 
peptides or in the form of a phage library displaying the various peptides 



25 



30 



18. The method according to claim 9 wherein said polypeptide ligand is an immunoreceptor 
tyrosine activation motif. ^ 

1 9. The method according to claim 18 wherein said ITAM is the TCR zeta ITAM. 

20. An inhibitor identified by the method of claim 9. 
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